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TWO HUMAN NSP-LTKE PROTEINS 

The present invention relates to nucleic acid and amino acid 
sequences of two novel human NSP-like proteins and to the use of these 
sequences in the diagnosis, study, prevention and treatment of disease. 

BACKGROUND ART 

Neuroendocrine-specific proteins (NSP-A, NSP-B, and NSP-C) are a 
recently characterized group of membrane-anchored endoplasmic reticulum 
(ER) proteins that share identical carboxy-terminal amino acid sequences 
[van de Velde H J et al (1994) J Cell Sci 107:2403-2416). Evidence 
suggests that NSP-A and NSP-C expression is restricted to neuronal and 
endocrine cell populations (van de Velde, supra). Immunohistochemical 
studies showed that rat NSP-A is expressed throughout the rat brain (van 
de Velde HJ et al (1994) Mol Brain Res 23:81-92). NSP-B, however, is 
found only in a small cell lung carcinoma cell line and probably 
represents an aberrant NSP gene product (Roebroek AJ et al (1993) J Biol 
Chem 268:13439-13447). A previously reported neuronally expressed rat 
gene, CI-13, and two partially sequenced human cDNAs (GI 391043 and GI 
894620), have a high degree of homology to NSPs which suggests that NSPs 
belong to a larger family of proteins (Wieczorek DF et al (1991) Mol 
Brain Res 10:33-41; Bell GI et al (1993) Hum Mol Genet 2:1793-798; 
Martin-Galla A et al (1992) Nat Genet 1:34-39). 

Two large hydrophobic regions characterize the NSPs and homologous 
proteins and suggest membrane association. In fact, immunofluorescence 
and biochemical studies have established an association between NSPs and 
membranes of the ER (Senden NH et al (1994) Eur J Cell Biol 65:341-353). 
Analysis of NSP-A deletion mutants indicates that the carboxy-terminal 
hydrophobic region is necessary for membrane binding (van de Velde et al, 
supra). Carboxy-terminal amino acid sequences of the NSPs are highly 
homologous, although they are not a perfect match to a consensus motif 
sufficient for retention of transmembrane proteins in the ER (van de 
Velde, supra; Jackson MR et al (1993) J Cell Biol 121:317-333). Thus, it 
appears likely that NSPs and related proteins are targeted to the ER by 
conserved carboxy-terminal amino acids. 

Immunostaining with anti-NSP-A antibodies suggests that NSP-A may 
be associated with both the rough and smooth neuronal ER. On the basis 
of this evidence and knowledge of neuronal ER function, van de Velde et 
al (1994; supra) conclude that NSPs may be involved in the protein 
transport process or in the regulation of intracellular calcium levels in 
neuronal cells. 
HSF-liko Protoina and Di.o.to 

Dysfunction of ER-mediated neuronal protein transport may 
contribute to neurodegenerative diseases. For example, in amyotrophic 
lateral sclerosis (ALS) , a degenerative disease of motor neurons, 
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position of n urofilam nts in neuronal axons leads to dramatic defects in 
ER-mediated ax nal transport of a variety of proteins (Collard JF et al 
(1995) Nature 375:61-64). Defects in protein transport have been further 
implicated in the pathogenesis of ALS by a transgenic mouse study in 
which ALS is modeled by a mutation in superoxide dismutase (SOD) . SOD 
mutant animals displayed clinical and pathological features of human ALS 
and showed axonal transport defects associated with dilation of the ER 
(Mourelatos Z et al (1996) Proc Natl Acad Sci 93:5472-5477). 

Analysis of specimens of a wide variety of primary human tumors 
show that NSP-A and NSP-C are expressed in small cell lung carcinoma, 
carcinoid tumors of the lung, but not in non-neuroendocrine non-small 
cell lung carcinomas (van de Velde et al (1994) Cancer Res 54:4769-4776). 
Furthermore, antibodies generated to small-cell lung carcinoma surface 
antigens recognize NSP-A, NSP-B, and NSP-C. Therefore, NSPs may act as 
markers in human lung cancer diagnosis and provide an avenue for 
corrective treatment (Senden NH et al (1994) Int J Cancer Suppl 8:84-88). 

New NSP-like proteins could satisfy a need in the art by providing 
new means of diagnosing and treating cancer and neurodegenerative 
disorders such as ALS. 

DISCLOSURE OF THE INVENTION 

The present invention discloses two novel human NSP-like proteins 
(hereinafter referred to individually as NSPLPA and NSPLPB, and 
collectively as NSPLP), characterized as having homology to human NSP-A 
(GI 307307), NSP-B (GI 307309), NSP-C (GI 307311), and rat CI-13 (GI 
281046) . Accordingly, the invention features two substantially purified 
NSP-liJce proteins, as shown in amino acid sequence of SEQ ID NO:l and SEQ 
ID NO: 3, and having characteristics of NSPs. 

One aspect of the invention features isolated and substantially 
purified polynucleotides which encode NSPLP. In a particular aspect, the 
polynucleotide is the nucleotide sequence of SEQ ID NO:2 or SEQ ID NO:4. 
In addition, the invention features polynucleotide sequences that 
hybridize under stringent conditions to SEQ ID NO: 2 or SEQ ID NO: 4. 

The invention further relates to nucleic acid sequences encoding 
NSPLP, oligonucleotides, peptide nucleic acids (PNA), fragments, portions 
or antisense molecules thereof, and expression vectors and host cells 
comprising polynucleotides which encode NSPLP. The present invention 
also relates to antibodies which bind specifically to NSPLP, 
pharmaceutical compositions comprising substantially purified NSPLP, 
fragments thereof, or antagonists of NSPLP, in conjunction with a 
suitable pharmaceutical carrier, and methods for producing NSPLP, 
fragments thereof, or antagonists of NSPLP. 

BRIEF DESCRIPTION OF DRAWINGS 
Figures 1A, IB and 1C show the amino acid sequence (SEQ ID NO:l) 
and nucleic acid sequence (SEQ ID NO: 2) of the novel NSP-like protein, 
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NSPLPA. The alignment was produced using MacDNAsis software (Hitachi 
Software Engin ering Co Ltd) . 

Figures 2A, 2B and 2C show the amino acid sequence (SEQ ID NO: 3) 
and nucleic acid sequence (SEQ ID NO: 4) of the novel NSP-like protein, 
5 NSPLPB (MacDNAsis software, Hitachi Software Engineering Co Ltd) . 

Figures 3A, 3B, 3C, 3D and 3E show the northern analysis for the 
consensus sequence (SEQ ID N0:4). The northern analysis was produced 
electronically using LIFESEQ m database (Incyte Pharmaceuticals, Palo Alto 
CA) . 

10 Figures 4A, 4B and 4C show the northern analysis for Incyte Clones 

31870 (SEQ ID NO: 2) (LIFESEQ BI database, Incyte Pharmaceuticals, Palo Alto 
CA) . 

Figure 5 shows the assembly for the consensus sequence (SEQ ID 

N0:2) . 

15 Figures 6A, 6B, 6C, 6D, 6E and 6F show the amino acid sequence 

alignments among NSPLPA (SEQ ID NO:l), NSPLPB (SEQ ID NO:3), NSP-A (GI 
307307; SEQ ID NO: 5), NSP-B (GI 307309; SEQ ID NO: 6), NSP-C (GI 307311); 
SEQ ID NO:7), and rat CI-13 (GI 281046 SEQ ID NO:8) produced using the 
multisequence alignment program of DNAStar software (DNAStar Inc, Madison 

20 HI). 

Figure 7 shows the hydrophobicity plot (generated using MacDNAsis 
software) for NSPLPA, SEQ ID NO:l; the X axis reflects amino acid 
position, and the negative Y axis, hydrophobicity (Figs. 7, 8, and 9) . 
Figure 8 shows the hydrophobicity plot for NSPLPB, SEQ ID NO: 3. 
25 Figure 9 shows the hydrophobicity plot for NSP-C, SEQ ID NO: 7. 

MODES FOR CARRYING OUT THE INVENTION 

Defiaitiona 

"Nucleic acid sequence" as used herein refers to an 
oligonucleotide, nucleotide or polynucleotide, and fragments or portions 

30 thereof, and to DNA or RNA of genomic or synthetic origin which may be 

single- or double-stranded, and represent the sense or antisense strand. 
Similarly, amino acid sequence as used herein refers to peptide or 
protein sequence. 

"Peptide nucleic acid" as used herein refers to a molecule which 

35 comprises an oligomer to which an amino acid residue, such as lysine, and 

an amino group have been added. These small molecules, also designated 
anti-gene agents, stop transcript elongation by binding to their 
complementary (template) strand of nucleic acid (Nielsen PE et al (1993) 
Anticancer Drug Des 8:53-63). 

40 As used herein, NSPLP refers to the amino acid sequences of 

substantially purified NSPLP obtained from any species, particularly 
mammalian, including bovine, vine, p rcine, murin , equine, and 
preferably human, from any source whether natural, synthetic, 
semi-synthetic or recombinant. 
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A "variant" of NSPLP is defined as an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" 
chang s, wherein a substituted amino acid has similar structural or 
chemical properties, eg, replacement of leucine with isoleucine. More 
rarely, a variant may have "nonconservative" changes, eg, replacement of 
a glycine with a tryptophan. Similar minor variations may also include 
amino acid deletions or insertions, or both. Guidance in determining 
which and how many amino acid residues may be substituted, inserted or 
deleted without abolishing biological or immunological activity may be 
found using computer programs well known in the art, for example, DNAStar 
software. 

A "deletion" is defined as a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide 
residues, respectively, are absent. 

An "insertion" or "addition" is that change in an amino acid or 
nucleotide sequence which has resulted in the addition of one or more 
amino acid or nucleotide residues, respectively, as compared to the 
naturally occurring NSPLP. 

A "substitution" results from the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, 
respectively. 

The term "biologically active" refers to a NSPLP having structural, 
regulatory or biochemical functions of a naturally occurring NSPLP. 
Likewise, "immunologically active" defines the capability of the natural, 
recombinant or synthetic NSPLP, or any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "derivative" as used herein refers to the chemical 
modification of a nucleic acid encoding NSPLP or the encoded NSPLP. 
Illustrative of such modifications would be replacement of hydrogen by an 
alkyl, acyl, or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of natural 
NSPLP. 

As used herein, the term "substantially purified" refers to 
molecules, either nucleic or amino acid sequences, that are removed from 
their natural environment, isolated or separated, and are at least 60% 
free, preferably 75% free, and most preferably 90% free from other 
components with which they are naturally associated. 

"Stringency" typically occurs in a range from about Tm-5°C (5 C C 
below the Tm of the probe) to about 20°C to 25°C below Tra. As will be 
understood by those of skill in the art, a stringency hybridization can 
be used to identify or detect identical polynucleotide sequences or to 
identify or detect similar or relat d polynucleotide sequences. 

Th term "hybridization" as used herein shall include "any process 
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by which a strand of nucleic acid joins with a compl mentary strand 
through base pairing" (Coombs J {1994] Dictionary ojt Biotechnology . 
Stockton Press, New York NY) . Amplification as carried out in the 
polymerase chain reaction technologies is described in Dieffenbach CW and 
5 GS Dveksler (1995, £££ Primer, a. Laboratory M&Ilil&l, Cold Spring Harbor 

Press, Plainview NY) . 
Proforrod Enbodiaonts 

The present invention relates to novel NSPLP and to the use of the 
nucleic acid and amino acid sequences in the study, diagnosis, prevention 

10 and treatment of disease. cDNAs encoding a portion of NSPLP were found 

in neuronal and endocrine tissue-derived cDNA libraries and in a variety 
of other tissues, including many types of tumors (Figs. 3A-3E and 4A-4C) . 

The present invention also encompasses NSPLP variants. A preferred 
NSPLP variant is one having at least 80% amino acid sequence similarity 

15 to the NSPLP amino acid sequence (SEQ ID NO:l), a more preferred NSPLP 

variant is one having at least 90% amino acid sequence similarity to SEQ 
ID NO:l and a most preferred NSPLP variant is one having at least 95% 
amino acid sequence similarity to SEQ ID NO:l. 

Nucleic acids encoding the human NSPLP of the present invention 

20 were first identified in cDNA, Incyte Clones 31870 (SEQ ID NO: 4; THP-1 

cell cDNA library, THP1NOB01) and 28742 (SEQ ID NO: 9; fetal spleen cDNA 
library, SPLNFET01), through a computer-generated search for amino acid 
sequence alignments. A consensus sequence, SEQ ID NO: 2, was derived from 
the following overlapping nucleic acid sequences: Incyte Clones 2874 2 

25 (from cDNA library SPLNFET01); 45022, 45074, and 45509 (CORNNOT01) ; 

121581 (MUSCNOT01); 570122 (MMLR3DT01); and 754150 (BRATUT02; Fig. 5). 
The nucleic acid sequence of SEQ ID NO: 2 encodes the NSPLPA amino acid 
sequence, SEQ ID NO:l. The nucleic acid sequence of SEQ ID NO: 4 encodes 
the NSPLPB amino acid sequence, SEQ ID NO: 3. The nucleic acid sequence 

30 of SEQ ID NO: 4 from residue C,„ to T T09 has 97% identity to the partial 

cDNA sequence of clone hbc043 (GI 39104; Bell et al, supra). 

The present invention is based, in part, on the chemical and 
structural homology among NSPLPA, NSPLPB, NSP-A (GI 307307; Roebroek et 
al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C (GI 307311; 

35 Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek et al, supra; 

Figs. 6A-D). NSPLPA and NSP-C share 66% identity, NSPLPB and NSP-C share 
48% identity, while NSPLPA and NSPLPB share 50% identity. As illustrated 
by Figures 7, 8, and 9, NSPLPA, NSPLPB, and NSP-C have similar 
hydrophobicity plots suggesting similar structure. Like the NSPs, NSPLPA 

40 and NSPLPB have two large hydrophobic regions that could be used for 

membrane attachment. The carboxy- terminal amino acids Lys l9S through Lys !9 , 
of NSPLPA precisely match, in positi n as well as sequence, an ER 
retention motif defined by Jackson et al (1993; supra). The novel NSPLPA 
is 199 amino acids long and has one potential N glycosylation site. The 
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novel NSPLPB is 241 amino acids long. 

Th«» WSPL.P Coding 9flOUMCfl» 

The nucleic acid and deduc d amino acid sequences of NSPLP are 
shown in Figures 1A, IB, 1C, 2A, 2B and 2C. In accordance with the 
invention, any nucleic acid sequence which encodes the amino acid 
sequence of NSPLP can be used to generate recombinant molecules which 
express NSPLP. In a specific embodiment described herein, a nucleotide 
sequence encoding a portion of NSPLP was first isolated as Incyte Clones 
31870 from a THP-1 cell cDNA library (THP1NOB01) . While, Incyte Clone 
28742 was first isolated from a fetal spleen cDNA library (SPLNFET01) . 

It will be appreciated by those skilled in the art that as a result 
of the degeneracy of the genetic code, a multitude of NSPLP-encoding 
nucleotide sequences, some bearing minimal homology to the nucleotide 
sequences of any known and naturally occurring gene may be produced. The 
invention contemplates each and every possible variation of nucleotide 
sequence that could be made by selecting combinations based on possible 
codon choices. These combinations are made in accordance with the 
standard triplet genetic code as applied to the nucleotide sequence of 
naturally occurring NSPLP, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode NSPLP and its variants 
are preferably capable of hybridizing to the nucleotide sequence of the 
naturally occurring NSPLP under appropriately selected conditions of 
stringency, it may be advantageous to produce nucleotide sequences 
encoding NSPLP or its derivatives possessing a substantially different 
codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or 
eukaryotic expression host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding NSPLP and its 
derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring 
sequence. 

It is now possible to produce a DNA sequence, or portions thereof, 
encoding a NSPLP and its derivatives entirely by synthetic chemistry, 
after which the synthetic gene may be inserted into any of the many 
available DNA vectors and cell systems using reagents that are well known 
in the art at the time of the filing of this application. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence 
encoding NSPLP or any portion thereof. 

Also included within the scope of the present invention are 
polynucleotide sequences that are capable of hybridizing to the 
nucleotide sequences of Figures 1A, IB, 1C, 2A, 2B, and 2C under various 
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conditions of stringency. Hybridization conditions are based on the 
melting temperature <Tm) of the nucleic acid binding complex or probe, as 
taught in Berger and Kimmel (1987 # Guide to Molecular Cloning 
Technigues . Methods in Enzymology. Vol 152, Academic Press, San Diego CA) 
incorporated herein by reference, and confer may be used at a defined 
stringency. 

Altered nucleic acid sequences encoding NSPLP which may be used in 
accordance with the invention include deletions, insertions or 
substitutions of different nucleotides resulting in a polynucleotide that 
encodes the same or a functionally equivalent NSPLP. The protein may 
also show deletions, insertions or substitutions of amino acid residues 
which produce a silent change and result in a functionally equivalent 
NSPLP. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues as long as 
the biological activity of NSPLP is retained. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; and amino acids with 
uncharged polar head groups having similar hydrophilicity values include 
leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; 
serine, threonine phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of 
NSPLP. As used herein, an "allele" or "allelic sequence" is an 
alternative form of NSPLP. Alleles result from a mutation, ie, a change 
in the nucleic acid sequence, and generally produce altered mRNAs or 
polypeptides whose structure or function may or may not be altered. Any 
given gene may have none, one or many allelic forms. Common mutational 
changes which give rise to alleles are generally ascribed to natural 
deletions, additions or substitutions of amino acids. Each of these 
types of changes may occur alone, or in combination with the others, one 
or more times in a given sequence. 

Methods for DNA sequencing are well known in the art and employ 
such enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (US 
Biochemical Corp, Cleveland OH)), Taq polymerase (Perkin Elmer, Norwalk 
CT), thermostable T7 polymerase (Amersham, Chicago IL), or combinations 
of recombinant polymerases and proofreading exonucleases such as the 
E LONG AS E Amplification System marketed by Gibco BRL (Gaithersburg MD) . 
Preferably, the process is automated with machines such as the Hamilton 
Micro Lab 2200 (Hamilton, Reno NV) , Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown MA) and the ABI 377 DNA sequencers fPerkin Elmer) . 
Extending th» Polvnuelootidft Saqiianca 

The polynucleotide sequence encoding NSPLP may be extended 
utilizing partial nucleotide sequence and various methods known in the 
art to detect upstream sequences such as promoters and regulatory 
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elements. Gobinda et al (1993; PCR Methods Applic 2:318-22) disclose 
"restriction-site" polymerase chain reaction (PCR) as a direct method 
which uses universal primers to retrieve unknown sequence adjacent to a 
known locus. First, genomic DNA is amplified in the presence of primer 
to a linker sequence and a primer specific to the known region. The 
amplified sequences are subjected to a second round of PCR with the same 
linker primer and another specific primer internal to the first one. 
Products of each round of PCR are transcribed with an appropriate RNA 
polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using 
divergent primers based on a known region (Triglia T et al (1988) Nucleic 
Acids Res 16:8186). The primers may be designed using OLIGO® 4.06 Primer 
Analysis Software (1992; National Biosciences Inc, Plymouth MN) , or 
another appropriate program, to be 22-30 nucleotides in length, to have a 
GC content of 50% or more, and to anneal to the target sequence at 
temperatures about 68 # -72° C. The method uses several restriction 
enzymes to generate a suitable fragment in the known region of a gene. 
The fragment is then circularized by intramolecular ligation and used as 
a PCR template. 

Capture PCR (Lagerstrom M et al (1991) PCR Methods Applic 1:111-19) 
is a method for PCR amplification of DNA fragments adjacent to a known 
sequence in human and yeast artificial chromosome DNA. Capture PCR also 
requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into an unknown portion of the DNA 
molecule before PCR. 

Another method which may be used to retrieve unknown sequences is 
that of Parker JD et al (1991; Nucleic Acids Res 19:3055-60). 
Additionally, one can use PCR, nested primers and PromoterFinder 
libraries to walk in genomic DNA (PromoterFinder™ Clontech (Palo Alto 
CA) . This process avoids the need to screen libraries and is useful in 
finding intron/exon junctions. 

Preferred libraries for screening for full length cDNAs are ones 
that have been size-selected to include larger cDNAs . Also, random 
primed libraries are preferred in that they will contain more sequences 
which contain the 5' and upstream regions of genes. A randomly primed 
library may be particularly useful if an oligo d(T> library does not 
yield a full-length cDNA. Genomic libraries are useful for extension 
into the 5' nontranslated regulatory region. 

Capillary electrophoresis may be used to analyze the size or 
confirm the nucleotide sequence of sequencing or PCR products. Systems 
for rapid sequencing are available from Perkin Elmer, Beckman Instruments 
(Fullerton CA) , and other companies. Capillary sequencing may employ 
flowabl polymers for electrophoretic separation, four different 
fluorescent dyes (one for each nucleotide) which are laser activated, and 
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detection of the emitted wavelengths by a charge coupled devise camera. 
Output/light intensity is converted to el ctrical signal using 
appropriate software (eg. Genotyper** and Sequence Navigator™ from Perkin 
Elm r) and the entire process from loading of samples to computer 
analysis and electronic data display is computer controlled. Capillary 
electrophoresis is particularly suited to the sequencing of small pieces 
of DNA which might be present in limited amounts in a particular sample. 
The reproducible sequencing of up to 350 bp of M13 phage DNA in 30 min 
has been reported (Ruiz-Martinez MC et al (1993) Anal Chem 65:2851-2858). 
Eroraaalon of the Nuclaotlda S ft mi an g a 

In accordance with the present invention, polynucleotide sequences 
which encode NSPLP, fragments of the polypeptide, fusion proteins or 
functional equivalents thereof may be used in recombinant DNA molecules 
that direct the expression of NSPLP in appropriate host cells. Due to 
the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid 
sequence, may be used to clone and express NSPLP. As will be understood 
by those of skill in the art, it may be advantageous to produce 
NSPLP-encoding nucleotide sequences possessing non-naturally occurring 
codons. Codons preferred by a particular prokaryotic or eukaryotic host 
(Murray E et al (1989) Nuc Acids Res 17:477-508) can be selected, for 
example, to increase the rate of NSPLP expression or to produce 
recombinant RNA transcripts having desirable properties, such as a longer 
half-life, than transcripts produced from naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered 
in order to alter a NSPLP coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, 
processing and/or expression of the gene product. For example, mutations 
may be introduced using techniques which are well known in the art, eg, 
site-directed mutagenesis to insert new restriction sites, to alter 
glycosylation patterns, to change codon preference, to produce splice 
variants, etc. 

In another embodiment of the invention, a natural, modified or 
recombinant polynucleotides encoding NSPLP may be ligated to a 
heterologous sequence to encode a fusion protein. For example, for 
screening of peptide libraries for inhibitors of NSPLP activity, it may 
be useful to encode a chimeric NSPLP protein that is recognized by a 
commercially available antibody. A fusion protein may also be engineered 
to contain a cleavage site located between a NSPLP sequence and the 
heterologous protein sequence, so that the NSPLP may be cleaved and 
purified away from the heterologous moiety. 

In an alternate embodiment of the invention, the coding sequence of 
NSPLP may be synthesized, whole or in part, using chemical methods well 
known in the art (see Caruthers MH et al (1980) Nuc Acids Res Syrap Ser 
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215-23, Horn T et al{1980) Nuc Acids Res Symp Ser 225-32, etc). 
Alternatively, the protein its If could be produced using chemical 
ra thods to synth size a NSPLP amino acid sequence, whole or in part. For 
example, peptide synthesis can be performed using various solid-phase 
5 techniques (Roberge JY et al (1995) Science 269:202-204) and automated 

synthesis may be achieved, for example, using the ABI 4 31A Peptide 
Synthesizer (Perkin Elmer) in accordance with the instructions provided 
by the manufacturer. 

The newly synthesized peptide can be substantially by preparative 
10 high performance liquid chromatography (eg, Creighton (1983) Proteins . 

Structures aad MPlCCUlar Prinnin1«»s. WH Freeman and Co, New York NY) . 
The composition of the synthetic peptides may be confirmed by amino acid 
analysis or sequencing (eg, the Edraan degradation procedure; Creighton, 
supra). Additionally the amino acid sequence of NSPLP, or any part 
thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins, or any part thereof, 
to produce a variant polypeptide. 

In order to express a biologically active NSPLP, the nucleotide 
sequence encoding NSPLP or its functional equivalent, is inserted into an 
appropriate expression vectcr, ie, a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence . 

Methods which are well known to those skilled in the art can be 
25 U3ed to construct expression vectors containing a NSPLP coding sequence 

and appropriate transcriptional or translational controls. These methods 
include in vitrp recombinant DNA techniques, synthetic techniques and in 
Vi v 9 recombination or genetic recombination. Such techniques are 
described in Sambrook et al (1989) Molecular Cloning . & Laboratory 
30 Manual* Cold Spring Harbor Press, Plainview NY and Ausubel FM et al 

(1989) Current PrPtPCols in Molecular Biology . John Wiley & Sons, New 
York NY. 

A variety of expression vector/host systems may be utilized to 
contain and express a NSPLP coding sequence. These include but are not 
35 limited to microorganisms such as bacteria transformed with recombinant 

bacteriophage, plasmid or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected 
with virus expression vectors (eg, baculovirus) ; plant cell systems 
transfected with virus expression vectors (eg, cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with bacterial expression 
vectors {eg, Ti or pBR322 plasmid); or animal cell systems. 

The "control elements" or "regulatory sequences" of these systems 
vary in their strength and specificities and are those nontranslated 
regions of the vector, enhancers, promoters, and 3' untranslated regions, 
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which interact with host cellular proteins to carry out transcription and 
translation. Depending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used. For example, when 
cloning in bacterial systems, inducible promoters such as the hybrid lacZ 
promoter of the Bluescript® phagemid (Stratagene, LaJolla CA) or pSportl 
(Gibco BRL) and ptrp-lac hybrids and the like may be used. The 
baculovirus polyhedrin promoter may be used in insect cells. Promoters 
or enhancers derived from the genomes of plant cells (eg, heat shock, 
RUBISCO; and storage protein genes) or from plant viruses (eg, viral 
promoters or leader sequences) may be cloned into the vector. In 
mammalian cell systems, promoters from the mammalian genes or from 
mammalian viruses are most appropriate. If it is necessary to generate a 
cell line that contains multiple copies of NSPLP, vectors based on SV40 
or EBV may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be 
selected depending upon the use intended for NSPLP. For example, when 
large quantities of NSPLP are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are 
readily purified may be desirable. Such vectors include, but are not 
limited to, the multifunctional £. coli cloning and expression vectors 
such as Bluescript® (Stratagene) , in which the NSPLP coding sequence may 
be ligated into the vector in frame with sequences for the amino-terminal 
Met and the subsequent 7 residues of Q-galactosidase so that a hybrid 
protein is produced; pIN vectors (Van Heeke & Schuster (1989) J Biol Chem 
264:5503-5509); and the like. pGEX vectors (Promega, Madison WI) may 
also be used to express foreign polypeptides as fusion proteins with 
glutathione S- transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free 
glutathione. Proteins made in such systems are designed to include 
heparin, thrombin or factor XA protease cleavage sites so that the cloned 
polypeptide of interest can be released from the GST moiety at will. 

In the yeast, Sflccharpm.ycfi a cerevisiae. a number of vectors 
containing constitutive or inducible promoters such as alpha factor, 
alcohol oxidase and PGH may be used. For reviews, see Ausubel et ai 
(supra) and Grant et al (1987) Methods in Enzymology 153:516-544. 

In cases where plant expression vectors are used, the expression of 
a sequence encoding NSPLP may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV 
(Brisson et al (1984) Nature 310:511-514) may be used alone or in 
combination with th omega leader sequence from TMV (Takamatsu et al 
(1987) EMBO J 6:307-311). Alternatively, plant promoters such as the 
small subunit of RUBISCO (Coruzzi et al (1984) EMBO J 3:1671-1680; 
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Broglie et al (1984) Science 224:838-843); or heat shock promoters 
(Winter J and Sinibaldi RM (1991) Results Probl Cell Differ 17:B5-105) 
may be used. These constructs can be introduced into plant cells by 
direct DNA transformation or pathogen-mediated transf ection . For reviews 
of such techniques, see Hobbs S or Murry LE in McGraw Hill Yearbook q£ 
Science And Technology (1992) McGraw Hill New York NY, pp 191-196 or 
Weissbach and Weissbach (1988) Methods for Plant Molecular Biology . 
Academic Press, New York NY, pp 421-463. 

An alternative expression system which could be used to express 
NSPLP is an insect system. In one such system, Autoarapha California 
nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign 
genes in SPQdPPtera fruqiperdfl cells or in Trlchoplusia larvae. The 
NSPLP coding sequence may be cloned into a nonessential region of the 
virus, such as the polyhedrin gene, and placed under control of the 
poiyhedrin promoter. Successful insertion of NSPLP will render the 
polyhedrin gene inactive and produce recombinant virus lacking coat 
protein coat. The recombinant viruses are then used to infect £. 
fruoioerda cells or Trichoplusia larvae in which NSPLP is expressed 
(Smith et al (1983) J Virol 46:584; Engelhard EK et al (1994) Proc Nat 
Acad Sci 91:3224-7) . 

In mammalian host cells, a number of viral-based expression systems 
may be utilized. In cases where an adenovirus is used as an expression 
vector, a NSPLP coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a nonessential El or E3 region 
of the viral genome will result in a viable virus capable of expressing 
NSPLP in infected host cells (Logan and Shenk (1984) Proc Natl Acad Sci 
81:3655-59). In addition, transcription enhancers, such as the rous 
sarcoma virus (RSV) enhancer, may be used to increase expression in 
mammalian host cells. 

Specific initiation signals may also be required for efficient 
translation of a NSPLP sequence. These signals include the ATG 
initiation codon and adjacent sequences. In cases where NSPLP, its 
initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be 
needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous transcriptional control signals including 
the ATG initiation codon must be provided. Furthermore, the initiation 
codon must be in the correct reading frame to ensure transcription of the 
entire insert. Exogenous transcriptional elements and initiation codons 
can be of various origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of enhancers appropriate to 
the cell system in use (Scharf D et al (1994) Results Probl Cell Differ 
20:125-62; Bittner et al (1987) Methods in Enzymol 153:516-54 4). 
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In addition, a host: cell strain may be chosen for its ability to 
modulate the expression of the inserted sequences or to process the 
expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosyiation, phosphorylation, lipidation and acylation. 
Post-translationai processing which cleaves a "prepro" form of the 
protein may also be important for correct insertion, folding and/or 
function. Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc 
have specific cellular machinery and characteristic mechanisms for such 
post-translational activities and may be chosen to ensure the correct 
modification and processing of the introduced, foreign protein. 

Tor long-term, high-yield production of recombinant proteins, 
stable expression is preferred. For example, cell lines which stably 
express NSPLP may be transformed using expression vectors which contain 
viral origins of replication or endogenous expression elements and a 
selectable marker gene. Following the introduction of the vector, cells 
may be allowed to grow for 1-2 days in an enriched media before they are 
switched to selective media. The purpose of the selectable marker is to 
confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced sequences. 
Resistant clumps of stably transformed cells can be proliferated using 
tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex 
virus thymidine kinase (Wigler M et al (1977) Cell 11:223-32) and adenine 
phosphoribosyltransferase (Lowy I et al (1980) Ceil 22:817-23) genes 
which can be employed in tk- or aprt- cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the 
basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler M et al (1980) Proc Natl Acad Sci 77:3567-70); npt, 
which confers resistance to the aminoglycosides neomycin and G-418 
(Colbere-Garapin F et al (1981) J Mol Biol 150:1-14) and als or pat, 
which confer resistance to chlorsulfuron and phosphinotricin 
acetyltransferase, respectively (Murry, supra). Additional selectable 
genes have been described, for example, trpB, which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to 
utilize histinol in place of histidine (Hartman SC and RC Mulligan (1988) 
Proc Natl Acad Sci 85:8047-51). Recently, the use of visible markers has 
gained popularity with such markers as anthocyanins, fi glucuronidase and 
its substrate, GUS, and luciferase and its substrate, luciferin, being 
widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes CA et al (1995) Methods Mol Biol 
55:121-131) . 
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Idonfcificafclon of Trmnaf oraanti Containing th« Polvnucl«otido Scmimcn 

Although the presence/absence of marker gene expression suggests 
that the gene f interest is also present, its presence and expression 
should be confirmed. For example, if the NSPLP is inserted within a 
marker gene sequence, recombinant cells containing NSPLP can be 
identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a NSPLP sequence under the 
control of a single promoter. Expression of the marker gene in response 
to induction or selection usually indicates expression of the tandem 
NSPLP as well. 

Alternatively, host cells which contain the coding sequence for 
NSPLP and express NSPLP may be identified by a variety of procedures 
known to those of skill in the art. These procedures include, but are 
not limited to, DNA-DNA or DNA-RNA hybridization and protein bioassay or 
immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of the nucleic acid 
or protein. 

The presence of the polynucleotide sequence encoding NSPLP can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes, portions or fragments of polynucleotides encoding NSPLP. Nucleic 
acid amplification based assays involve the use of oligonucleotides or 
oligomers based on the NSPLP-encoding sequence to detect transf ormants 
containing DNA or RNA encoding NSPLP. As used herein "oligonucleotides" 
or "oligomers" refer to a nucleic acid sequence of at least about 10 
nucleotides and as many as about 60 nucleotides, preferably about 15 to 
30 nucleotides, and more preferably about 20-25 nucleotides which can be 
used as a probe or amplimer. 

A variety of protocols for detecting and measuring the expression 
of NSPLP, using either polyclonal or monoclonal antibodies specific for 
the protein are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA) , radioimmunoassay (RIA) and fluorescent 
activated cell sorting (FACS) . A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes 
on NSPLP is preferred, but a competitive binding assay may be employed. 
These and other assays are described, among other places, in Hampton R et 

ai (1990, Serological Methods, a Laboratory Manual, aps Press, st Paul 

MN) and Maddox DE et al (1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by 
those skilled in the art and can be used in various nucleic acid and 
amino acid assays. Means for producing labeled hybridization or PCR 
probes for detecting sequences related to polynucleotides encoding NSPLP 
include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the NSPLP 
sequence, or any portion of it, may be cloned into a vector for the 
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production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitrn 
by addition of an appropriate RNA polymerase such as T7, T3 or SP6 and 
labeled nucleotides. 

A number of companies such as Pharmacia Biotech {Piscataway NJ) , 
Promega (Madison WI) P and US Biochemical Corp (Cleveland OH) supply 
commercial kits and protocols for these procedures. Suitable reporter 
molecules or labels include those radionuclides, enzymes, fluorescent, 
chemiluminescent, or chromogenic agents as well as substrates, cof actors, 
inhibitors, magnetic particles and the like. Patents teaching the use of 
such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149 and 4,366,241. Also, recombinant 
immunoglobulins may be produced as shown in US Patent No. 4,616,567 
incorporated herein by reference. 
Purification of HSfLP 

Host cells transformed with a nucleotide sequence encoding NSPLP 
may be cultured under conditions suitable for the expression and recovery 
of the encoded protein from cell culture. The protein produced by a 
recombinant cell may be secreted or contained intracellularly depending 
on the sequence and/or the vector used. As will be understood by those 
of skill in the art, expression vectors containing polynucleotides 
encoding NSPLP can be designed with signal sequences which direct 
secretion of NSPLP through a prokaryotic or eukaryotic cell membrane. 
Other recombinant constructions may join NSPLP to nucleotide sequence 
encoding a polypeptide domain which will facilitate purification of 
soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 12:441-53; cf 
discussion of vectors infra containing fusion proteins) . 

NSPLP may also be expressed as a recombinant protein with one or 
more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan 
modules that allow purification on immobilized metals, protein A domains 
that allow purification on immobilized immunoglobulin, and the domain 
utilized in the FLAGS extension/affinity purification system (Immunex 
Corp, Seattle WA) . The inclusion of a cleavable linker sequences such as 
Factor XA or enterokinase (Invitrogen, San Diego CA) between the 
purification domain and NSPLP is useful to facilitate purification. One 
such expression vector provides for expression of a fusion protein 
compromising an NSPLP and contains nucleic acid encoding 6 histidine 
residues followed by thioredoxin and an enterokinase cleavage site. The 
histidine residues facilitate purification on IMIAC (immobilized metal 
ion affinity chromotography as described in Porath et al (1992) Protein 
Expression and Purification 3: 263-281) while the enterokinase cleavage 
site provides a means for purifying NSPLP from the fusion protein. 
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In addition to recombinant production, fragments of NSPLP may be 
produced by direct peptide synthesis using solid-phase techniques (cf 
Stewart et ai (1969) Solid- Phase Peptide Synthesis . WH Freeman Co, San 
Francisco; Merrifield J (1963) J Am Chem Soc 85:2149-2154). Xa vitro 
protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be achieved, for example, using 
Applied Biosystems 431A Peptide Synthesizer (Perkin Elmer, Foster City 
CA) in accordance with the instructions provided by the manufacturer. 
Various fragments of NSPLP may be chemically synthesized separately and 
combined using chemical methods to produce the full length molecule. 
U«q« of NSPLP 

The rationale for use of the nucleotide and polypeptide sequences 
disclosed herein is based in part on the chemical and structural homology 
among the novel NSPLP proteins disclosed herein, NSP-A (GI 307307; 
Roebroek et al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C 
(GI 307311; Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek 
et al, supra) . 

Accordingly, NSPLP or a NSPLP derivative may be used to treat 
cancer and neurodegenerative disorders, such as ALS. In those conditions 
where NSPLP protein activity is not desirable, cells could be transfected 
with antisense sequences of NSPLP-encoding polynucleotides or provided 
with antagonists of NSPLP* 
NSPLP Anfcifarvii— 

NSPLP-specif ic antibodies are useful for the diagnosis of 
conditions and diseases associated with expression of NSPLP. Such 
antibodies may include, but are not limited to, polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced by a Fab 
expression library. Neutralizing antibodies, ie, those which inhibit 
dimer formation, are especially preferred for diagnostics and 
therapeutics. 

NSPLP for antibody induction does not require biological activity; 
however, the protein fragment, or oligopeptide must be antigenic. 
Peptides used to induce specific antibodies may have an amino acid 
sequence consisting of at least five amino acids, preferably at least 10 
amino acids. Preferably, they should mimic a portion of the amino acid 
sequence of the natural protein and may contain the entire amino acid 
sequence of a small, naturally occurring molecule. Short stretches of 
NSPLP amino acids may be fused with those of another protein such as 
keyhole limpet hemocyanin and antibody produced against the chimeric 
molecule. Procedures well known in the art can be used for the 
production of antibodies to NSPLP. 

For the production of antibodies, various hosts including goats, 
rabbits, rats, mice, etc may be immunized by injection with NSPLP or any 
portion, fragment or oligopeptide which retains immunogenic properties. 
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Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to, 
Freund's, mineral gels such as aluminum hydroxide, and surface active 
substanc s such as lysolecithin, pluronic polyols, polyanions, peptides, 
oil emulsions, keyhole limpet hemocyanin, and dinitrophenol . BCG 
{bacilli Calmette-Guerin) and Corynebacterium parvum are potentially 
useful human adjuvants. 

Monoclonal antibodies to NSPLP may be prepared using any technique 
which provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the 
hybridoma technique originally described by Koehler and Milstein (1975 
Nature 256:495-497), the human B-cell hybridoma technique (Kosbor et al 
(1983) Immunol Today 4:72; Cote et al (1983) Proc Natl Acad Sci 
80:2026-2030) and the EBV-hybridoraa technique (Cole et al (1985) 
Monoclonal Antibodies and Cancer Therapy. Alan R Liss inc, New York ny, 

pp 77-96). 

In addition, techniques developed for the production of "chimeric 
antibodies", the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison et al (1984) Proc Natl Acad Sci 
81:6851-6855; Neuberger et al (1984) Nature 312:604-608; Takeda et al 
(1985) Nature 314:452-454). Alternatively, techniques described for the 
production of single chain antibodies (OS Patent No. 4,946,778) can be 
adapted to produce NSPLP-specif ic single chain antibodies 

Antibodies may also be produced by inducing jjx vivo production in 
the lymphocyte population or by screening recombinant immunoglobulin 
libraries or panels of highly specific binding reagents as disclosed in 
Orlandi et al (1989, Proc Natl Acad Sci 86: 3833-3837) r and Winter G and 
Milstein C (1991; Nature 349:293-299). 

Antibody fragments which contain specific binding sites for NSPLP 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse WD et al (1989) Science 256:1275-1281). 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 
specificities are well known in the art. Such immunoassays typically 
involve the formation of complexes between NSPLP and its specific 
antibody and the measurement of complex formation. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
two noninterfering epitopes on a specific NSPLP protein is preferred, but 
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a competitive binding assay may also be employed. These assays are 
described in Maddox DC et al (1983, J Exp Med 158:1211). 
Diagnostic Aaiavi Vaina NSPLP flp^cif ig AnfrihmHjm 

Particular NSPLP antibodies are useful for the diagnosis of 
5 conditions or diseases characterized by expression of NSPLP or in assays 

to monitor patients being treated with NSPLP, agonists or inhibitors. 
Diagnostic assays for NSPLP include methods utilizing the antibody and a 
label to detect NSPLP in human body fluids or extracts of cells or 
tissues. The polypeptides and antibodies of the present invention may be 

10 used with or without modification. Frequently, the polypeptides and 

antibodies will be labeled by joining them, either covalently or 
noncovalently, with a reporter molecule. A wide variety of reporter 
molecules are known, several of which were described above. 

A variety of protocols for measuring NSPLP, using either polyclonal 

15 or monoclonal antibodies specific for the respective protein are known in 

the art. Examples include entyme-linked immunosorbent assay (ELISA) # 
radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS) . A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on NSPLP is preferred, but a 

20 competitive binding assay may be employed. These assays are described, 

among other places, in Maddox, DE et al (1983, J Exp Med 158:1211). 

In order to provide a basis for diagnosis, normal or standard 
values for NSPLP expression must be established. This is accomplished by 
combining body fluids or cell extracts taken from normal subjects, either 

25 animal or human, with antibody to NSPLP under conditions suitable for 

complex formation which are well known in the art. The amount of 
standard complex formation may be quantified by comparing various 
artificial membranes containing known quantities of NSPLP with both 
control and disease samples from biopsied tissues. Then, standard values 

30 obtained from normal samples may be compared with values obtained from 

samples from subjects potentially affected by disease. Deviation between 
standard and subject values establishes the presence of disease state. 

Prag Bortaning 

NSPLP, its catalytic or immunogenic fragments or oligopeptides 
thereof, can be used for screening therapeutic compounds in any of a 
variety of drug screening techniques. The fragment employed in such a 
test may be free in solution, affixed to a solid support, borne on a eel* 
surface, or located intracellular^ . The formation of binding complexes, 
between NSPLP and the agent being tested, may be measured. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity 
to the NSPLP is described in detail in "Determination of Amino Acid 
Sequence Antigenicity" by Geysen HN, WO Application 84/03564, published 
on September 13, 1984, and incorporated herein by reference. In summary, 



35 



40 



18 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06S41 PCT/US97/13469 

large numbers of different small peptide test compounds ar synthesized 
on a solid substrate, such as plastic pins or some other surface. The 
peptide test compounds are reacted with fragments of NSPLP and washed* 
Bound NSPLP is then detected by methods well known in the art. Purified 
NSPLP can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on a solid support. 

This invention also contemplates the use of competitive drug 
screening assays in which neutralizing antibodies capable of binding 
NSPLP specifically compete with a test compound for binding NSPLP. In 
this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with NSPLP. 
Uses of the Polynucleotide Encoding NSPLP 

A polynucleotide encoding NSPLP, or any part thereof, may be used 
for diagnostic and/or therapeutic purposes. For diagnostic purposes, 
polynucleotides encoding NSPLP of this invention may be used to detect 
and quantitate gene expression in biopsied tissues in which expression of 
NSPLP may be implicated. The diagnostic assay is useful to distinguish 
between absence, presence, and excess expression of NSPLP and to monitor 
regulation of NSPLP levels during therapeutic intervention. Included in 
the scope of the invention are oligonucleotide sequences, antisense RNA 
and DNA molecules, and PHAs. 

Another aspect of the subject invention is to provide for 
hybridization or PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding NSPLP or closely related 
molecules. The specificity of the probe, whether it is made from a 
highly specific region, eg, 10 unique nucleotides in the 5* regulatory 
region, or a less specific region, eg, especially in the 3' region, and 
the stringency of the hybridization or amplification (maximal, high, 
intermediate or low) will determine whether the probe identifies only 
naturally occurring sequences encoding NSPLP, alleles or related 
sequences . 

Probes may also be used for the detection of related sequences and 
should preferably contain at least 50% of the nucleotides from any of 
these NSPLP encoding sequences. The hybridization probes of the subject 
invention may be derived from the nucleotide sequence of SEQ ID NO: 2 or 
from genomic sequence including promoter, enhancer elements and introns 
of the naturally occurring NSPLP. Hybridization probes may be labeled by 
a variety of reporter groups, including radionuclides such as 32P or 35S, 
or enzymatic labels such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Other means for producing specific hybridization probes for DNAs 
encoding NSPLP include the cloning of nucleic acid sequences encoding 
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NSPLP or NSPLP derivatives into vectors for the production of mRNA 
probes. Such vectors are known in the art and are commercially available 
and may be used to synthesize RNA probes ill vitro by means of the 
addition of the appropriate RNA polymerase as T7 or SP6 RNA polymerase 
and the appropriate radioactively labeled nucleotides. 

Polynucleotide sequences encoding NSPLP may be used for the 
diagnosis of conditions or diseases with which the expression of NSPLP is 
associated. For example, polynucleotide sequences encoding NSPLP may be 
used in hybridization or PCR assays of fluids or tissues from biopsies to 
detect NSPLP expression. The form of such qualitative or quantitative 
methods may include Southern or northern analysis, dot blot or other 
membrane-based technologies; PCR technologies; dip stick, pIN, chip and 
ELISA technologies. All of these techniques are well known in the art 
and are the basis of many commercially available diagnostic kits. 

The nucleotide sequences encoding NSPLP disclosed herein provide 
the basis for assays that detect activation or induction associated with 
cancer and neurodegenerative disorders, such as ALS. The nucleotide 
sequence encoding NSPLP may be labeled by methods known in the art and 
added to a fluid or tissue sample from a patient under conditions 
suitable for the formation of hybridization complexes. After an 
incubation period, the sample is washed with a compatible fluid which 
optionally contains a dye (or other label requiring a developer) if the 
nucleotide has been labeled with an enzyme. After the compatible fluid 
is rinsed off, the dye is quantitated and compared with a standard. If 
the amount of dye in the biopsied or extracted sample is significantly 
elevated over that of a comparable control sample, the nucleotide 
sequence has hybridized with nucleotide sequences in the sample, and the 
presence of elevated levels of nucleotide sequences encoding NSPLP in the 
sample indicates the presence of the associated disease. 

Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regime in animal studies, in clinical 
trials, or in monitoring the treatment of an individual patient. In 
order to provide a basis for the diagnosis of disease, a normal or 
standard profile for NSPLP expression must be established. This is 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with NSPLP, or a portion thereof, under 
conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained for 
normal subjects with a dilution series of NSPLP run in the same 
experiment where a known amount of a substantially purified NSPLP is 
used. Standard values obtained from normal samples may be compared with 
values obtained from samples from patients afflicted with 
NSPLP-associated diseases. Deviation between standard and subject values 
is used to establish the presence of disease. 
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Once disease is established, a therapeutic agent is administered 
and a treatment profile is generated. Such assays may be repeated on a 
regular basis to evaluate whether the values in the profil progr ss 
toward or r turn to the normal or standard pattern. Successive treatment 
profiles may be used to show the efficacy of treatment over a period of 
several days or several months. 

PCR, as described in US Patent Nos. 4, 683, 195 and 4, 965, 188, 
provides additional uses for oligonucleotides based upon the NSPLP 
sequence. Such oligomers are generally chemically synthesized, but they 
may be generated enzymatically or produced from a recombinant source. 
Oligomers generally comprise two nucleotide sequences, one with sense 
orientation (5'->3') and one with antisense (3'<-5'), employed under 
optimized conditions for identification of a specific gene or condition. 
The same two oligomers, nested sets of oligomers, or even a degenerate 
pool of oligomers may be employed under less stringent conditions for 
detection and/or quantitation of closely related DNA or RNA sequences. 

Additionally, methods which may be used to quantitate the 
expression of a particular molecule include radiolabeling (Melby PC et al 
1993 J Immunol Methods 159:235-44) or biotinylating (Duplaa C et al 1993 
Anal Biochem 229-36) nucleotides, coamplif ication of a control nucleic 
acid, and standard curves onto which the experimental results are 
interpolated. Quantitation of multiple samples may be speeded up by 
running the assay in an ELISA format where the oligomer of interest is 
presented in various dilutions and a spectre-photometric or colorimetric 
response gives rapid quantitation. For example, the presence of a 
relatively high amount of NSPLP in extracts of biopsied tissues may 
indicate the onset of cancer. A definitive diagnosis of this type may 
allow health professionals to begin aggressive treatment and prevent 
further worsening of the condition. Similarly, further assays can be 
used to monitor the progress of a patient during treatment. Furthermore, 
the nucleotide sequences disclosed herein may be used in molecular 
biology techniques that have not yet been developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently 
known such as the triplet genetic code, specific base pair interactions, 
and the like. 
Th«r*PQUfcie Vma 

Based upon its homology to genes encoding NSP-like proteins and its 
expression profile, polynucleotide sequences encoding NSPLP disclosed 
herein may be useful in the treatment of conditions such as cancer and 
neurodegenerative disorders, such as ALS. 

Expression vectors derived from retroviruses, adenovirus, herpes or 
vaccinia viruses, or from various bact rial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue or cell 
population. Methods which are well known to those skilled in the art can 
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be used to construct recombinant vectors which will express antisense 
polynucleotides of the gen encoding NSPLP. See, for example, the 
techniques described in Sarabrook et al (supra) and Ausubel et al (supra) . 

The p lynucleotides comprising full length cDNA sequence and/or its 
regulatory elements enable researchers to use sequences encoding NSPLP as 
an investigative tool in sense (Youssoufian H and HF' Lodish 1993 Mol Cell 
Biol 13:98-104) or antisense (Eguchi et al (1991) Annu Rev Biochem 
60:631-652) regulation of gene function. Such technology is now well 
known in the art, and sense or antisense oligomers, or larger fragments, 
can be designed from various locations along the coding or control 
regions. 

Genes encoding NSPLP can be turned off by transfecting a cell or 
tissue with expression vectors which express high levels of a desired 
NSPLP-encoding fragment. Such constructs can flood cells with 
untranslatable sense or antisense sequences. Even in the absence of 
integration, into the DNA, such vectors may continue to transcribe RNA 
molecules until ail copies are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating 
vector (Mettler I, personal communication) and even longer if appropriate 
replication elements are part of the vector system. 

As mentioned above, modifications of gene expression can be 
obtained by designing antisense molecules, DNA, RNA or PNA, to the 
control regions of gene encoding NSPLP, ie, the promoters, enhancers, and 
introns. Oligonucleotides derived from the transcription initiation 
site, eg, between -10 and +10 regions of the leader sequence, are 
preferred. The antisense molecules may also be designed to block 
translation of mRNA by preventing the transcript from binding to 
ribosomes. Similarly, inhibition can be achieved using "triple helix" 
base-pairing methodology. Triple helix pairing compromises the ability 
of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA were reviewed by Gee JE et al (In: Huber BE 
and BI Carr (1994) Molecular and Immunologic A pproaches . Futura 
Publishing Co, Mt Kisco NY) . 

Ribozymes are enzymatic RNA molecules capable of catalyzing the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Within the scope of 
the invention are engineered hammerhead motif ribozyme molecules that can 
specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding NSPLP. 

Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences, GUA, GUU and GUC. 
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Once identified, short RNA sequences of b tween 15 and 20 ribonucleotides 
corresponding to the region of the target gen containing the cleavage 
site may be evaluated for secondary structural features which may render 
the oligonucleotide inoperable. The suitability of candidat targets may 
also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays. 

Antisense molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of RNA molecules. These 
include techniques for chemically synthesizing oligonucleotides such as 
solid phase phosphoramidite chemical synthesis. Alternatively, RNA 
molecules may be generated by ia vitro and in. vivo transcription of DNA 
sequences encoding NSPLP. Such DNA sequences may be incorporated into a 
wide variety of vectors with suitable RNA polymerase promoters such as T7 
or SP6. Alternatively, antisense cDNA constructs that synthesize 
antisense RNA constitutively or inducibly can be introduced into cell 
lines, cells or tissues. 

RNA molecules may be modified to increase intracellular stability 
and half-life. Possible modifications include, but are not limited to, 
the, addition of flanking sequences at the 5* -and/or 3 1 ends of the 
molecule or the use of phosphorothioate or 2* O-methyl rather than 
phosphodiesterase linkages within the backbone of the molecule. This 
concept is inherent in the production of PNAs and can be extended in all 
of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine and wybutosine as well as acetyl-, methyl-, thio- and 
similarly modified forms of adenine, cytidine, guanine, thymine, and 
uridine which are not as easily recognized by endogenous endonucleases . 

Methods for introducing vectors into cells or tissues include those 
methods discussed infra and which are equally suitable for Ixi vivo - in 
Yitro and vivo therapy. For vivo therapy, vectors are introduced 
into stem cells taken from the patient and clonally propagated for 
autologous transplant back into that same patient is presented in US 
Patent Nos. 5,399,493 and 5,437,994, disclosed herein by reference. 
Delivery by transfection and by liposome are quite well known in the art. 

Furthermore, the nucleotide sequences for NSPLP disclosed herein 
may be used in molecular biology techniques that have not yet been 
developed, provided the new techniques rely on properties of nucleotide 
sequences that are currently known, including but not limited to such 
properties as the triplet genetic code and specific base pair 
interactions. 

Detection and Mapping of Related Polynucleo tide Samienefta 

The nucleic acid sequence for NSPLP can also be used to generate 
hybridization pr bes for mapping the naturally occurring genomic 
sequence. The sequence may be mapped to a particular chromosome or to a 
specific region of the chromosome using well known techniques. These 
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include xix 3>tu hybridization to chromosomal spreads, flow-3orted 
chromosomal preparations, or artificial chroraos me constructions such as 
yeast artificial chromosomes, bacterial artificial chromosomes, bacterial 
PI constructions or single chromosome cDNA libraries as reviewed in Price 
CM (1993; Blood Rev 7:127-34) and Trask BJ (1991; Trends Genet 7:149-54). 

The technique of fluorescent in situ hybridization of chromosome 
spreads has been described, among other places, in Verma et al (1988) 

Human Chromosomes: A Manual ol Basic rechnimips. pergamon Press, New 
Vork NY. Fluorescent in situ hybridization of chromosomal preparations 
and other physical chromosome mapping techniques may be correlated with 
additional genetic map data. Examples of genetic map data can be found 
in the 1994 Genome Issue of Science (265:1981f). Correlation between the 
location of the gene encoding NSPLP on a physical chromosomal map and a 
specific disease (or predisposition to a specific disease) may help 
delimit the region of DNA associated with that genetic disease. The 
nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. 

In situ hybridization of chromosomal preparations and physical 
mapping techniques such as linkage analysis using established chromosomal 
markers may be used for extending genetic maps. For example an sequence 
tagged site based map of the human genome was recently published by the 
Whitehead-MIT Center for Genomic Research (Hudson TJ et al(1995) Science 
270:1945-1954). Often the placement of a gene on the chromosome of 
another mammalian species such as mouse (Whitehead Institute/MIT Center 
for Genome Research, Genetic Map of the Mouse, Database Release 10, April 
28 , 1995) may reveal associated markers even if the number or arm of a 
particular human chromosome is not known. New sequences can be assigned 
to chromosomal arms, or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. Once 
a disease or syndrome, such as ataxia telangiectasia (AT), has been 
crudely localized by genetic linkage to a particular genomic region, for 
example, AT to llq22-23 (Gatti et al (1988) Nature 336:577-580), any 
sequences mapping to that area may represent associated or regulatory 
genes for further investigation. The nucleotide sequence of the subject 
invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc. among normal, carrier or 
affected individuals. 
Pharmaceutical Compositions 

The present invention relates to pharmaceutical compositions which 
may comprise nucleotides, proteins, antibodies, agonists, antagonists, or 
inhibitors, alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
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biocompatible pharmaceutical carrier, including, but not limited to, 
saline, buffered saline, dextrose, and water. Any of these molecules can 
be administered to a patient alone, or in combination with other agents, 
drugs or hormones, in pharmaceutical compositions wher it is mixed with 
excipient(s) or pharmaceutically acceptable carriers. In one embodiment 
of the present invention, the pharmaceutically acceptable carrier is 
pharmaceutically inert. 

ftfotinigtrfltign off PhMMCOTttcal CosppaAttong 

Administration of pharmaceutical compositions is accomplished 
orally or parenterally. Methods of parenteral delivery include topical, 
intra-arterial (directly to the tumor), intramuscular, subcutaneous, 
intramedullary, intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable 
pharmaceutically acceptable carriers comprising excipients and 
auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest 
edition of "Remington' s Pharmaceutical Sciences" (Maack Publishing Co, 
Easton PA) . 

Pharmaceutical compositions for oral administration can be 
formulated using pharmaceutically acceptable carriers well known in the 
art in dosages suitable for oral administration. Such carriers enable 
the pharmaceutical compositions to be formulated as tablets, pills, 
dragees, capsules, liquids, gels, syrups, slurries, suspensions and the 
like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through 
combination of active compounds with solid excipient, optionally grinding 
a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. 
Suitable excipients are carbohydrate or protein fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; starch from corn, 
wheat, rice, potato, or other plants; cellulose such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethyl cellulose; and gums 
including arable and tragacanth; and proteins such as gelatin and 
collagen. If desired, disintegrating or solubilizing agents may be 
added, such as the cross-linked polyvinyl pyrrolidone, agar, alginic 
acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as 
concentrated sugar solutions, which may also contain gum arabic, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 
dioxide, lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyestuffs or pigments may be added to the tablets or dragee 
coatings for product identification or to characterize the quantity of 
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active compound, ie, dosage. 

Pharmaceutical preparations which can be used orally include 
push-fit capsules made of gelatin, as well as soft, sealed capsules made 
of gelatin and a coating such as glycerol or sorbitol. Push-fit capsules 
can contain active ingredients mixed with a filler or binders such as 
lactose or starches, lubricants such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as fatty oils, liquid 
paraffin, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations for parenteral administration include 
aqueous solutions of active compounds. For injection, the pharmaceutical 
compositions of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks' s 
solution, Ringer's solution, or physiologically buffered saline. Aqueous 
injection suspensions may contain substances which increase the viscosity 
of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or 
agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the 
particular barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art. 
MantfttQtwra and Storage 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner that known in the art, eg, by means of 
conventional mixing, dissolving, granulating, dragee-making, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be 
formed with many acids, including but not limited to hydrochloric, 
sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to 
be more soluble in aqueous or other protonic solvents that are the 
corresponding free base forms. In other cases, the preferred preparation 
may be a lyophilized powder in lmM-50 mM histidine, O.lft-2% sucrose, 
2%-7% mannitol at a pH range of 4.5 to 5.5 that is combined with buffer 
prior to use. 

After pharmaceutical compositions comprising a compound of the 
invention formulated in a acceptable carrier have been prepared, they can 
be placed in an appropriate container and labeled for treatment of an 
indicated condition. For administration of NSPLP, such labeling would 
include amount, frequency and method of administration. 
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Therapeutically Effective Dona 

Pharmaceutical compositions suitable for use in the present 
invention include compositi ns wherein the active ingredients are 
contained in an effectiv amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those 
skilled in the art. 

For any compound, the therapeutically effective dose can be 
estimated initially either in cell culture assays, eg, of neoplastic 
cells, or in animal models, usually mice, rabbits, dogs, or pigs. The 
animal model is also used to achieve a desirable concentration range and 
route of administration. Such information can then be used to determine 
useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of protein 
or its antibodies, antagonists, or inhibitors which ameliorate the 
symptoms or condition. Therapeutic efficacy and toxicity of such 
compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, eg, ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of 
the population) . The dose ratio between therapeutic and toxic effects is 
the therapeutic index, and it can be expressed as the ratio, LD50/ED50. 
Pharmaceutical compositions which exhibit large therapeutic indices are 
preferred. The data obtained from cell culture assays and animal studies 
is used in formulating a range of dosage for human use. The dosage of 
such compounds lies preferably within a range of circulating 
concentrations that include the ED50 with little or no toxicity. The 
dosage varies within this range depending upon the dosage form employed, 
sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of 
the patient to be treated. Dosage and administration are adjusted to 
provide sufficient levels of the active moiety or to maintain the desired 
effect. Additional factors which may be taken into account include the 
severity of the disease state, eg, tumor size and location; age, weight 
and gender of the patient; diet, time and frequency of administration, 
drug combination (s) , reaction sensitivities, and tolerance/response to 
therapy. Long acting pharmaceutical compositions might be administered 
every 3 to 4 days, every week, or once every two weeks depending on 
half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up 
to a total dose of about 1 g, depending upon the route of administration. 
Guidance as to particular dosages and methods of delivery is provided in 
the literature and generally available to practitioners in the art. 
Those skilled in the art will mploy different formulations for 
nucleotides than for proteins or their inhibitors. Similarly, delivery 
of polynucl otides or polypeptides will be specific to particular cells, 
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conditions, locations, etc. 

It is contemplated, for example, that NSPLP or an NSPLP derivative 
can be delivered in a suitable formulation to block the progression of 
cancerous cell growth or of neuronal degeneration. Similarly, 
administration of NSPLP antagonists may also inhibit the activity or 
shorten the lifespan of this protein. 

The examples below are provided to illustrate the subject invention 
and are not included for the purpose of limiting the invention. 

INDUSTRIAL APPLICABILITY 
I Construction of eDMA Librariaa 

THP-1 

THP-1 is a human leukemic cell line derived from the blood of a 1- 
year-old boy with acute monocytic leukemia. The THP-1 cells represent 
monocytes. The THP-1 cDNA library was custom constructed by Stratagene 
(Stratagene, 11099 M. Torrey Pines Rd., La Jolla, CA 92037) essentially 
as described below. 

Stratagene prepared the cDNA library using oligo d(T) priming. 
Synthetic adapter oligonucleotides were ligated onto the cDNA molecules 
enabling them to be inserted into the Uni-ZAP 1 * vector system 
(Stratagene). This allowed high efficiency unidirectional (sense 
orientation) lambda library construction and the convenience of a plasmid 
system with blue/white color selection to detect clones with cDNA 
insertions. 

The quality of the cDNA library was screened using DNA probes, and 
then, the pBluescript® phagemid (Stratagene) was excised. This phagemid 
allows the use of a plasmid system for easy insert characterization, 
sequencing, site-directed mutagenesis, the creation of unidirectional 
deletions and expression of fusion polypeptides. Subsequently, the 
custom-constructed library phage particles were infected into E. coli 
host strain XLl-Blue® (Stratagene) . The high transformation efficiency 
of this bacterial strain increases the probability that the cDNA library 
will contain rare, under-represented clones. Alternative unidirectional 
vectors include, but are not limited to, pcDNAI (Invitrogen, San Diego 
CA) and pSHlox-1 (Novagen, Madison WI). 

Fatal gpltttn 

The human spleen cell cDNA library was custom constructed by 
Stratagene (catalogue * 937205. Stratagene, La Jolla CA) . The starting 
cell population is mixed, having been obtained from fetal spleens which 
have a diverse cell population. Furthermore, the fetal spleens have been 
pooled from different sources. Poly(A+) RNA (mRNA) was purified from the 
spleen cells. cDNA was synthesized from the mRNA. Synthetic adaptor 
oligonucleotides were ligated onto cDNA ends enabling its insertion into 
Uni-ZAP" vector system (Stratagene), allowing high efficiency 
unidirectional (sense orientation) lambda library construction and the 
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convenience of a plasmid system with blue/white color selection to detect 
clones with cDNA insertions. Alternative unidirectional vectors are 
pcDNAl (invitrogen, San Diego CA) and pSHlox-i (Novagen, Madison WI) . 
II Iaolafcion of cDNA Clones 
THP-1 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
defined sequences on the target DNA, and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
released from the cells and purified, and used to reinfect fresh host 
cells (SOLR, Stratagene) where double-stranded phagemid DNA was produced. 
Because the phagemid carries the gene for (J-lactamase, the newly 
transformed bacteria were selected on medium containing ampicillin. 

An alternate method of purifying phagemid has recently become 
available. It utilizes the Miniprep Kit (Catalog No. 77468, available 
from Advanced Genetic Technologies Corp., 19212 Orbit Drive, 
Gaithersburg, Maryland) . This kit is in the 96-well format and provides 
enough reagents for 960 purifications. Each kit is provided with a 
recommended protocol, which has been employed except for the following 
changes. First, the 96 wells are each filled with only 1 ml of sterile 
terrific broth with carbenicillin at 25 mg/L and glycerol at 0.4%. After 
the wells are inoculated, the bacteria are cultured for 24 hours and 
lysed with 60 m1 of lysis buffer. A centrif ugation step (2900 rpm for 5 
minutes) is performed before the contents of the block are added to the 
primary filter plate. The optional step of adding isopropanol to TRIS 
buffer is not routinely performed. After the last step in the protocol, 
samples are transferred to a Beckman 96-well block for storage. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 

Purification System from the QIAGEN® DNA Purification System (QIAGEN Inc, 

Chatsworth CA) . This product provides a convenient, rapid and reliable 

high -throughput method for lysing the bacterial cells and isolating 

highly purified phagemid DNA using QIAGEN anion-exchange resin particles 

with EMPORE™ membrane technology from 3M in a multiwell format. The DNA 

was eluted from the purification resin and prepared for DNA sequencing 

and other analytical manipulations. 
Fetal aplMn 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
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defined sequences on the target DNA, and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
r leased from the cells and purified, and used to r infect fresh host 
cells (SOLR, Stratagene) where double-stranded phagemid DNA was produced. 
Because the phagemid carries the gene for (^-lactamase, the newly 
transformed bacteria were selected on medium containing arapicillin. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 
Purification System from the QIAGEN® DNA Purification System (QIAGEN Inc, 
Chatsworth CA) . This product provides a convenient, rapid and reliable 
high-throughput method for lysing the bacterial cells and isolating 
highly purified phagemid DNA using QIAGEN anion-exchange resin particles 
with EM PORE™ membrane technology from 3M in a multiwell format. The DNA 
was eluted from the purification resin and prepared for DNA sequencing 
and other analytical manipulations, 

III Homology Searching of cDNA donas and Thair Emdu ggd Prohiin f 

Each cDNA was compared to sequences in GenBank using a search 
algorithm developed by Applied Biosystems and incorporated into the 
INHERIT" 670 Sequence Analysis System. In this algorithm, Pattern 
Specification Language (TRW Inc, Los Angeles CA> was used to determine 
regions of homology. The three parameters that determine how the 
sequence comparisons run were window size, window offset, and error 
tolerance. Using a combination of these three parameters, the DNA 
database was searched for sequences containing regions of homology to the 
query sequence, and the appropriate sequences were scored with an initial 
value. Subsequently, these homologous regions were examined using dot 
matrix homology plots to distinguish regions of homology from chance 
matches. Smith-Waterman alignments were used to display the results of 
the homology search. 

Peptide and protein sequence homologies were ascertained using the 
INHERIT- 670 Sequence Analysis System in a way similar to that used in 
DNA sequence homologies. Pattern Specification Language and parameter 
windows were used to search protein databases for sequences containing 
regions of homology which were scored with an initial value. Dot-matrix 
homology plots were examined to distinguish regions of significant 
homology from chance matches. 

BLAST, which stands for Basic Local Alignment Search Tool (Altschul 
SF (1993) J Mol Evol 36:290-300; Altschul, SF et al (1990) J Mol Biol 
215:403-10), was used to search for local sequence alignments. BLAST 
produces alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the 1 
alignments, BLAST is especially useful in determining exact matches or in 
identifying homologs. BLAST is useful for matches which do not contain 
gaps. The fundamental unit of BLAST algorithm output is the High-scoring 
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Segment Pair (HSP) - 

An HSP consists of two sequence fragments of arbitrary but equal 
lengths whose alignment is locally maximal and for which the alignment 
score meets or exceeds a threshold r cutoff score set by the user. The 
BLAST approach is to look for HSPs between a query sequence and a 
database sequence, to evaluate the statistical significance of any 
matches found, and to report only those matches which satisfy the 
user-selected threshold of significance. The parameter E establishes the 
statistically significant threshold for reporting database sequence 
matches. E is interpreted as the upper bound of the expected frequency 
of chance occurrence of an HSP {or set of HSPs) within the context of the 
entire database search. Any database sequence whose match satisfies E is 
reported in the program output. 
IV Korthern Analvaia 

Northern analysis is a laboratory technique used to detect the 
presence of a transcript of a gene and involves the hybridization of a 
labelled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound (Sambrook et al . supra). 

Analogous computer techniques using BLAST (Altschul SF 1993 and 
1990, supra) are used to search for identical or related molecules in 
nucleotide databases such as GenBank or the LIFESEQf" database (Incyte, 
Palo Alto CA) . This analysis is much faster than multiple, membrane- 
based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is 
categorized as exact or homologous. 

The basis of the search is the product score which is defined as: 
* sequence Identity x % mayiirmm BLAST score* 
100 

and it takes into acccount both the degree of similarity between two 
sequences and the length of the sequence match. For example, with a 
product score of 40, the match will be exact within a 1-2% error; and at 
70, the match will be exact. Homologous molecules are usually identified 
by selecting those which show product scores between 15 and 40, although 
lower scores may identify related molecules. 

V SXtmaiOfl Of WSPIP-Encodina Polynucleotid e to Full length or fco 
RttCOVT Raaulatorv Eloann^ 

Full length NSPLP-encoding nucleic acid sequence {SEQ ID NO: 2) is 
used to design oligonucleotide primers for extending a partial nucleotide 
sequence to full length or for obtaining 5' sequences from genomic 
libraries. One primer is synthesized to initiate extension in the 
antisense direction (XLR) and the other is synthesized to extend sequence 
in the sense direction (XLF) . Primers allow the extension of the known 
NSPLP-encoding sequence "outward" generating amplicons containing new, 
unknown nucleotide sequence for the region of interest (US Patent 
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Application 08/487, 112, filed June 7, 1995, specifically incorporated by 
reference) . The initial primers are designed from the cDNA using OLIGO* 
4.06 Primer Analysis Software (National Biosciences), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC 
5 content of 50% or more, and to anneal to the target sequence at 

temperatures about 68°-72° C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic library 
are used to extend the sequence; the latter is most useful to obtain 5' 

10 upstream regions. If more extension is necessary or desired, additional 

sets of primers are designed to further extend the known region. 

By following the instructions for the XL-PCR kit (Per kin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity 
amplification is obtained. Beginning with 40 pmol of each primer and the 

IS recommended concentrations of all other components of the kit, PGR is 

performed using the Peltier Thermal Cycler (PTC200; MJ Research, 

Watertown MA) and the following parameters: 

Step 1 94* C for 1 min (initial denaturation) 

Step 2 65° C for 1 min 

20 Step 3 68° C for 6 min 

Step 4 94° C for 15 sec 

Step 5 65° C for 1 min 

Step 6 68° C for 7 min 

Step 7 Repeat step 4-6 for 15 additional cycles 

25 Step 8 94° C for 15 sec 

Step 9 65" C for 1 min 

Step 10 68° C for 7:15 min 

Step 11 Repeat step 8-10 for 12 cycles 

Step 12 72° C for 8 min 

30 Step 13 4° C (and holding) 

A 5-10 m1 aliquot of the reaction mixture is analyzed by 
electrophoresis on a low concentration (about 0.6-0.8%) agarose raini-gel 
to determine which reactions were successful in extending the sequence. 

35 Bands thought to contain the largest products were selected and cut out 

of the gel. Further purification involves using a commercial gel 
extraction method such as QIAQuick*" (QIAGEN Inc) . After recovery of the 
DNA, Xlenow enzyme was used to trim single-stranded, nucleotide overhangs 
creating blunt ends which facilitate religation and cloning. 

40 After ethanol precipitation, the products are redissolved in 13 /*1 

of ligation buffer, 1^1 T4-DNA ligase (15 units) and 1/il T4 
polynucleotide kinase are added, and the mixture is incubated at room 
temperature for 2-3 hours or overnight at 16° C. Competent En. coli cells 
(in 40 nl of appropriate media) are transformed with 3 ftl of ligation 

45 mixture and cultured in 80 fxl of SOC medium (Sambrook J et al, supra). 

After incubation for one hour at 37° C, the whole transformation mixture 
is plated on Luria Bertani (LB) -agar (Sambrook J et al, supra) containing 
2xCarb. The following day, several colonies are randomly picked from 
each plate and cultured in 150 */l of liquid LB/2xCarb medium placed in an 
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individual well of an appropriate, commercially-available, sterile 96- 
well microtiter plate. The following day, 5 *<1 of each ov might culture 
is transferred into a non-sterile 96-well plate and after dilution 1:10 
with water, 5 Atl of each sample is transferred into a PGR array. 
5 For PCR amplification, 18 pi of concentrated PGR reaction mix 

(3.3x) containing 4 units of rTth DNA polymerase, a vector primer and one 
or both of the gene specific primers used for the extension reaction are 
added to each well. Amplification is performed using the following 
conditions: 

10 Step 1 94° C for 60 sec 

Step 2 94° C for 20 sec 

Step 3 55° C for 30 sec 

Step 4 72° C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

15 Step 6 72° C for 180 sec 

Step 7 4° C (and holding) 

Aliquots of the PCR reactions are run on agarose gels together 
with molecular weight markers. The sizes of the PCR products are 
compared to the original partial cDNAs, and appropriate clones are 
20 selected, ligated into plasmid and sequenced. 

VI L«bttlino and Uae of Hybridisation Profana 

Hybridization probes derived from SEQ ID NO: 2 are employed to 
screen cONAs , genomic DMAs or mRNAs. Although the labeling of 
oligonucleotides, consisting of about 20 base-pairs, is specifically 
25 described, essentially the same procedure is used with larger cDNA 

fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4.06 (National Biosciences), labeled by combining 50 pmol 
of each oligomer and 250 mCi of [v- 12 P] adenosine triphosphate (Amersham, 
Chicago IL) and T4 polynucleotide kinase (DuPont NEN*, Boston MA) . The 
30 . labeled oligonucleotides are substantially purified with Sephadex G-25 

super fine resin column (Pharmacia) . A portion containing 10 7 counts per 
minute of each of the sense and antisense oligonucleotides is used in a 
typical membrane based hybridization analysis of human genomic DNA 
digested with one of the following endonucleases (Ase I, Bgl II, Eco RI, 
35 Pst I, Xba 1, or Pvu II; DuPont NEN*) . 

The DNA from each digest is fractionated on a 0.7 percent agarose 
gel and transferred to nylon membranes (Nytran Plus, Schleicher & 
Schuell, Durham NH) . Hybridization is carried out for 16 hours at 40°C. 
To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0.1 x saline 
sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT AR W film 
(Kodak, Rochester NY) is exposed to the blots in a Phosphoimager cassette 
(Molecular Dynamics, Sunnyvale CA) for several hours, hybridization 
patterns are compared visually. 



40 



45 



VII Antisenae Moleculaa 
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The NSPLP-encoding sequence, or any part thereof, is used to 
inhibit in vivo or in vitro expression of naturally occurring NSPLP. 
Although use of antis nse oligonucleotides, comprising ab ut 20 base* 
pairs, is specifically described, essentially the same procedure is used 
5 with larger cDNA fragments. An oligonucleotide based on the coding 

sequences of NSPLP, as shown in Figs. 1A, IB, 2A, and 2B is used to 
inhibit expression of naturally occurring NSPLP. The complementary 
oligonucleotide is designed from the most unique 5 * sequence as shown in 
Figures 1A, IB, 2A, and 2B and used either to inhibit transcription by 

10 preventing promoter binding to the upstream nontranslated sequence or 

translation of an NSPLP-encoding transcript by preventing the ribosome 
from binding. Using an appropriate portion of the leader and 5' sequence 
of SEQ ID N0:2, an effective antisense oligonucleotide includes any 15-20 
nucleotides spanning the region which translates into the signal or early 

IS coding sequence of the polypeptide as shown in Figures 1A, IB, 2A, and 

2B. 

Vizi Esprtmpg of NSPLP 

Expression of the NSPLP is accomplished by subcloning the cDNAs 
into appropriate vectors and transfecting the vectors into host cells. 

20 In this case, the cloning vector, pSport, previously used for the 

generation of the cDNA library is used to express NSPLP in £. coli . 
Upstream of the cloning site, this vector contains a promoter for 
ft-galactosidase, followed by sequence containing the amino-terminal Met 
and the subsequent 7 residues of A-galactosidase . Immediately following 

25 these eight residues is a bacteriophage promoter useful for transcription 

and a linker containing a number of unique restriction sites. 

Induction of an isolated, transfected bacterial strain with IPTG 
using standard methods produces a fusion protein which consists of the 
first seven residues of fi-galactosidase, about 5 to IS residues of 

30 linker, and the full length NSPLP-encoding sequence. The signal sequence 

directs the secretion of NSPLP into the bacterial growth media which can 
be used directly in the following assay for activity. 

IX NSPLP Activity 

35 NSPLP' s ER targeting activity can be assessed by a method of van 

de Velde et al (1994, supra). Microsomes are collected from cells 
expressing NSPLP by a 100,000 g spin in a method described by Verboomen H 
et al (1992 Biochem J 286:591-596). After treatment with 0.5 M KC1 and 
centrifugation the pellet is resuspended and subject to gel 

4 0 electrophoresis. Western blot analysis using antibodies to NSPLP reveals 

the presence of NSPLP in the ER membrane. 



Production of WSPLP Specific Antibodies 
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NSPLP substantially purified using PAGE electrophoresis (Sambrook, 
supra) is used to immunize rabbits and to produce antibodies using 
standard protocols. The amino acid sequence translated from NSPLP is 
analyzed using DNAStar software (DNAStar Inc) to determine regions of 
high immunogenicity and a corresponding oligopolypeptide is synthesized 
and used to raise antibodies by means known to those of skill in the art. 
Analysis to select appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions (shown in Figs. 7 and 8) is 
described by Ausubel FM et al (supra) . 

Typically, the oligopeptides are 15 residues in length, 
synthesized using an Applied Biosystems Peptide Synthesizer Model 4 31A 
using fmoc -chemistry, and coupled to keyhole limpet hemocyanin (KLH, 
Sigma) by reaction with M-roaleimidobenzoyl-N-hydroxysuccinimide ester 
(MBS; Ausubel FM et al, supra) . Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. The resulting 
antisera are tested for antipeptide activity, for example, by binding the 
peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, 
washing, and reacting with radioiodinated, goat anti-rabbit IgG. 
XX Purification of Naturally Occurring NSPLP Uainy Specific 

AntibodiM 

Naturally occurring or recombinant NSPLP is substantially purified 
by immunoaf f inity chromatography using antibodies specific for NSPLP. An 
immunoaf finity column is constructed by covalently coupling NSPLP 
antibody to an activated chromatographic resin such as CnBr-activated 
Sepharose (Pharmacia Biotech) . After the coupling, the resin is blocked 
and washed according to the manufacturer's instructions. 

Media containing NSPLP is passed over the immunoaf finity column, 
and the column is washed under conditions that allow the preferential 
absorbance of NSPLP (eg, high ionic strength buffers in the presence of 
detergent) . The column is eluted under conditions that disrupt 
antibody/NSPLP binding (eg, a buffer of pH 2-3 or a high concentration of 
a chaotrope such as urea or thiocyanate ion), and NSPLP is collected. 
XII Identification of MoIqcuIm Which Interact with NSPLP 

NSPLP, or biologically active fragments thereof, are labelled with 
l "l Bolton-Hunter reagent (Bolton, AE and Hunter, WM (1973) Biochem J 133 
529) . Candidate molecules previously arrayed in the wells of a 96 well 
plate are incubated with the labelled NSPLP, washed and any wells with 
labelled NSPLP complex are assayed. Data obtained using different 
concentrations of NSPLP are used to calculate values for the number, 
affinity, and association of NSPLP with the candidate molecules. 

All publications and patents mentioned in the above specification 
are herein incorporated by reference. Various modifications and 
variations of the described method and system of the invention will be 
apparent to those skilled in the art without departing from the scope and 
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spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such 
specific embodiments. Indeed, various modifications of the described 
5 modes for carrying out the invention which are obvious to those skilled 

in molecular biology or related fields are intended to be within the 
scope of the following claims. 



36 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06841 



PCT/US97/13469 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: INCYTE PHARMACEUTICALS , INC. 

(ii) TITLE OF THE INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

(iii) NUMBER OF SEQUENCES: 9 

<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 
(C> CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: U.S. 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) PCT APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/700,607 
{B) FILING DATE: AUGUST 12, 1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,74 9 

(C) REFERENCE /DOCKET NUMBER: PF-0114 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: Consensus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Met Asp Gly 


Gin 


Lys 


Lys 


Asn 


Trp 


Lys 


Asp Lys Val 


Val 


Asp 


Leu 


Leu 


1 




5 










10 






15 




Tyr Trp Arg 


Asp 


lie 


Lys 


Lys 


Thr 


Gly Val Val Phe 


Gly Ala Ser 


Leu 




20 










25 






30 






Phe Leu Leu 


Leu 


Ser 


Leu 


Thr 


Val 


Phe 


Ser He Val 


Ser 


Val 


Thr 


Ala 


35 










40 






45 








Tyr lie Ala 


Leu 


Ala 


Leu 


Leu 


Ser 


Val 


Thr He Ser 


Phe 


Arg 


He 


Tyr 


50 








55 






60 










Lys Gly Val 


lie 


Gin 


Ala 


lie 


Gin 


Lys 


Ser Asp Glu 


Gly 


His 


Pro 


Phe 


65 






70 








75 








80 


Arg Ala Tyr 


Leu 


Glu 


Ser 


Glu 


Val 


Ala 


He Ser Glu 


Glu 


Leu 


Val 


Gin 




85 










90 






95 




Lys Tyr Ser 


Asn 


Ser 


Ala 


Leu 


Gly 


His 


Val Asn Cys 


Thr 


He 


Lys 


Glu 




100 










105 






110 






Leu Arg Arg 


Leu 


Phe 


Leu 


Val 


Asp 


Asp 


Leu Val Asp 


Ser 


Leu 


Lys 


Phe 


115 










120 






125 








Ala Val Leu 


Met 


Trp 


Val 


Phe 


Thr 


Tyr 


Val Gly Ala 


Leu 


Phe 


Asn 


Gly 


130 






135 






140 










Leu Thr Leu 


Leu 


lie 


Leu 


Ala 


Leu 


He 


Ser Leu Phe 


Ser 


Val 


Pro 


Val 


145 






150 








155 








160 


He Tyr Glu 


Arg 


His 


Gin 


Ala 


Gin 


He 


Asp His Tyr 


Leu Gly Leu Ala 


165 










170 






175 




Asn Lys Asn 


Val 


Lys 


Asp 


Ala 


Met 


Ala 


Lys He Gin 


Ala 


Lys 


He 


Pro 


180 








185 






190 






Gly Leu Lys 


Arg 


Lys 


Ala 


Glu 

















195 



12) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE : 

(A) LIBRARY: 

(B) CLONE: Consensus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GGTTTGTGCA GTTACAGCTT TTCTNTTGGT ATGCATAATT AATANTTGGA GCTGCAAAGA 60 
GATCGTGACA AGAGATGGAC GGTCAGAAGA AAAATTGGAA GGACAAGGTT GTTGACCTCC 120 
TGTACTGGAG AGACATTAAG AAGACTGGAG TGGTGTTTGG TGCCAGCCTA TTCCTGCTGC 180 
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TTTCATTGAC AGTATTCAGC ATTGTGAGCG TAACAGCCTA CATTGCCTTG GCCCTGCTCT 240 

CTGTGACCAT CAGCTTTAGG ATATACAAGG GTGTGATCCA AGCTATCCAG AAATCAGATG 300 

AAGGCCACCC ATTCAGGGCA TATCTGGAAT CTGAAGTTGC TATATCTGAG GAGTTGGTTC 360 

AGAAGTACAG TAATTCTGCT CTTGGTCATG TGAACTGCAC GATAAAGGAA CTCAGGCGCC 420 

TCTTCTTAGT TGATGATTTA GTTGATTCTC TGAAGTTTGC AGTGTTGATG TGGGTATTTA 480 

CCTATGTTGG TGCCTTGTTT AATGGTCTGA CACTACTGAT TTTGGCTCTC ATTTCACTCT 540 

TCAGTGTTCC TGTTATTTAT GAACGGCATC AGGCACAGAT AGATCATTAT CTAGGACTTG 600 

CAAATAAGAA TGTTAAAGAT GCTATGGCTA AAATCCAAGC AAAAATCCCT GGATTGAAGC 660 

GCAAAGCTGA ATGAAAACGC CCAAAATAAT TAGTAGGAGT TCATCTTTAA AGGGGATATT 720 

CATTTGATTA TACGGGGGAG GGTCAGGGAA GAACGACCTT GACGTTGCAG TGCAGTTTCA 780 

CAGATCGTTG TTAGATCTT 799 

(2) INFORMATION FOR SEQ ID NO: 3; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THP1NOB01 

(B) CLONE: 31870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met 


Ala 


Glu 


Arg 


Xaa 


Ala 


Ala 


Thr 


Gin 


Ser 


His 


Ser He 


Ser 


Ser Ser 


1 








5 










10 








15 


Ser 


Phe 


Gly 


Ala 


Glu 


Pro 


Ser 


Ala 


Pro 


Gly Gly 


Gly Gly Ser 


Pro Gly 








20 










25 








30 


Ala 


Cys 


Pro 


Ala 


Leu 


Gly 


Thr 


Lys 


Ser 


Cys 


Ser 


Ser Ser Cys 


Ala Val 






35 










40 








45 






His 


Asp 


Leu 


He 


Xaa 


Trp 


Arg 


Asp 


Val 


Lys 


Lys 


Thr Gly 


Phe 


Val Phe 




50 










55 










60 






Gly 


Thr 


Thr 


Leu 


He 


Met 


Leu 


Leu 


Ser 


Leu 


Ala 


Ala Phe 


Ser 


Val He 


65 










70 










75 






BO 


Ser 


Val 


Val 


Ser 


Tyr 


Leu 


He 


Leu 


Ala 


Leu 


Leu 


Ser Val 


Thr 


He Ser 










85 










90 








95 


Phe 


Arg 


He 


Tyr 


Lys 


Ser 


Val 


He 


Gin 


Ala 


Val 


Gin Lys 


Ser 


Glu Glu 








100 










105 






110 




Gly 


His 


Pro 


Phe 


Lys 


Ala 


Tyr 


Leu 


Asp 


Val Asp 


He Thr 


Leu 


Ser Ser 






115 










120 








125 






Glu 


Ala 


Phe 


His 


Asn 


Tyr 


Met 


Asn 


Ala 


Ala 


Met 


Val His 


He 


Asn Arg 




130 










135 










140 




Ala 


Leu 


Lys 


Leu 


He 


He 


Arg 


Leu 


Phe 


Leu 


Val 


Glu Asp Leu Val Asp 


145 










150 










155 






160 


Ser 


Leu 


Lys 


Leu 


Ala 


Val 


Phe 


Met 


Trp 


Leu 


Met 


Thr Tyr Val 


Gly Ala 










165 










170 








175 


Val 


Phe 


Asn 


Gly 


He 


Thr 


Leu 


Leu 


He 


Leu 


Ala 


Glu Leu 


Leu 


He Xaa 








180 










185 








190 




Ser 


Val 


Pro 


He 


Val 


Tyr 


Xaa 


Lys 


Tyr 


Lys 


Val 


Pro Ser 


Lys 


Thr Pro 






195 










200 








205 




Trp 


Asn 


Arg 


Gin 


Lys 


Lys 


Gly 


Arg 


He 


Ser 


Thr 


Trp Lys 


Pro 


Glu Met 




210 










215 










220 






Gin 


Gin 


Leu 


Leu 


Lys 


His 


His 


Leu 


He 


Val 


He 


Thr Ser 


Leu 


Leu Val 


225 










230 










235 






240 



Leu 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 
(A) LIBRARY: THP1NOB01 
(B> CLONE: 31870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ACACNAGCGN NTCGNGCTCC CGAACCTCTA GCTGCGACTC GGANTGAGTC AGTCAGTCTG 60 

TCGGAGTCTG TCCTCGGAGC AGGCGGAGTA AAGGGACTTG AGCGAGCCAG TTGCCGGATT 120 

ATTCTATTTC CCCTCCCTCT CTCCCGCCCC GTATCTCTTT TCATTTTNNT NCCACCCTTG 180 

CTCGCGTANC ATGGCGGAGC GTNCGGCGGC CACTCAGTCC CATTCCATCT CCTCGTCGTC 240 

CTTCGGAGCC GAGCCGTCCG CGCCCGGCGG CGGCGGGAGC CCAGGAGCCT GCCCCGCCCT 300 

GGGGACGAAG AGCTGCAGCT CCTCCTGTGC GGTGCACGAT CTGATTTTMT GGAGAGATGT 360 

GAAGAAGACT GGGTTTGTCT TTGGCACCAC GCTGATCATG CTGCTTTCCC TGGCAGCTTT 420 

CAGTGTCATC AGTGTGGTTT CTTACCTCAT CCTGGCTCTT CTCTCTGTCA CCATCAGCTT 480 

CAGGATCTAC AAGTCCGTCA TCCAAGCTGT ACAGAAGTCA GAAGAAGGCC ATCCATTCAA - 540 

AGCCTACCTG GACGTAGACA TTACTCTGTC CTCAGAAGCT TTCCATAATT ACATGAATGC 600 

TGCCATGGTG CACATCAACA GGGCCCTGAA ACTCATTATT CGTCTCTTTC TGGTAGAAGA 660 

TCTGGTTGAC TCCTTGAAGC TGGCTGTCTT CATGTGGCTG ATGACCTATG TTGGTGCTGT 720 

TTTTAACGGA ATCACCCTTC TAATTCTTGC TGAACTGCTC ATTTTNAGTG TCCCGATTGT 780 

NTATNAGAAG TACAAGGTTC CAAGCAAAAC TCCCTGGAAT CGCCAAAAAA AAGGCAGAAT 840 

AAGTACATGG AAACCAGAAA TGCAACAGTT ACTAAAACAC CATTTAATAG TTATAACGTC 900 

GTTACTTGTA CTATGAAGGA AAATACTCAG TGTCAGCTTG AGCCTGCATT CCAAGCTTTT 960 

TTTTTAATTT GGTGGTTTTC TCCCATCCTT TCCCTTTAAC CCTCAGTNTC AAGCACAAAN 1020 

TTTNATGGAC TGATAANNGA" TCTATNTTAG ANCTCAGAAG ANGANAGNTT CANNTGCATA 1080 

GGNTAAGGNA NTACC 1095 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307307 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met 


Ala 


Ala 


Pro 


Gly Asp 


Pro 


Gin 


Asp 


Glu Leu Leu Pro Leu Ala Gly 


1 








5 








10 15 


Pro 


Gly 


Ser 


Gin 


Trp Leu 


Arg 


His 


Arg 


Gly Glu Gly Glu Asn Glu Ala 








20 








25 


30 


Val 


Thr 


Pro 


Lys 


Gly Ala 


Thr 


Pro 


Ala 


Pro Gin Ala Gly Glu Pro Ser 






35 








40 




45 


Pro 


Gly 


Leu 


Gly 


Ala Arg 


Ala 


Arg 


Glu 


Ala Ala Ser Arg Glu Ala Gly 




50 








55 






60 


Ser 


Gly 


Pro 


Ala 


Arg Gin 


Ser 


Pro 


Val 


Ala Met Glu Thr Ala Ser Thr 


65 








70 








75 80 


Gly 


Val 


Ala 


Gly 


Val Ser 


Ser 


Ala 


Met 


Asp His Thr Phe Ser Thr Thr 



85 90 95 
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Ser Lys Asp Gly Glu Gly Ser Cys Tyr 
100 105 
Cys Tyr Pro Pro Gin Glu Asp Ser Thr 

115 120 
Lys Glu Asn Gly His Val Thr lie Ser 

130 135 
Thr Pro Gly Pro Ser Leu Pro Asp Val 
145 150 
Leu Phe Ser Ser Asp Ser Gly He Glu 
165 

Glu Val Asn Lys He Leu Ala Asp Pro 
180 185 
Ala Tyr Lys Tyr He Asp He Thr Arg 

195 200 
Glu Gin His Kis Pro Glu Leu Glu Asp 

210 215 
Lys Asp Thr Asp He Ser He Lys Pro 
225 230 
Lys Pro Ala Pro Val Glu Gly Lys He 
245 

Glu Ser Thr Phe Ala Pro Tyr He Asp 
260 265 
Arg Ala Pro Gin He Thr Thr Pro Val 

275 280 
Glu Pro Ser Val Glu Thr Thr Thr Gin 

290 295 
Asp He Cys Leu Lys Pro Ser Pro Asp 
305 310 
Ser Glu Pro Glu Asp Asp Ser Pro Gly 
325 

Gly Thr Glu Pro Ser Ala Ala Glu Ser 
340 345 
Glu Asp Glu Leu He Thr Ala He Lys 

355 360 
Glu Thr Ala Glu Asn Pro Arg Pro Val 

370 375 
Glu Val Lys Ala Arg Ser Gly Pro Pro 
385 390 
His Glu Ala Ser Ser Ala Glu Ser Gly 
405 

Ser Glu Asd Pro Met Ala Ala Glu Asp 
420 425 
Ser Phe Gly His Val Gly Gly Pro Pro 

435 440 
He Gin Tyr Ser He Leu Arg Glu Glu 

450 455 
Glu Leu He lie Glu Ser Cys Asp Ala 
465 470 
Pro Lys Arg Glu Gin Asp Ser Pro Pro 
485 

Ala He Arg Glu Glu Thr Gly Val Arg 
500 505 
Arg Arg Gly Leu Ala Glu Pro Gly Ser 

515 520 
Glu Pro Gin Pro Gly Pro Glu Leu Pro 

530 535 
Pro Glu Thr Pro Met Leu Pro Arg Lys 
545 550 
Asn Gin Ser Pro Ala Ala Thr Lys Gly 
565 

Ala Pro Pro Pro Leu Leu Phe Leu Asn 
580 585 



Thr Ser Leu He Ser Asp He 
110 

Tyr Phe Thr Gly He Leu Gin 
125 

Glu Ser Pro Glu Glu Leu Gly 
140 

Pro Gly He Glu Ser Arg Gly 
155 160 
Met Thr Pro Ala Glu Ser Thr 
170 175 
Leu Asp Gin Met Lys Ala Glu 
190 

Pro Glu Glu Val Lys His Gin 
205 

Lys Asp Leu Asp Phe Lys Asn 
220 

Glu Gly Val Arg Glu Pro Asp 
235 240 
He Lys Asp His Leu Leu Glu 
250 255 
Asp Leu Ser Glu Glu Gin Arg 
270 

Lys He Thr Leu Thr Glu He 
285 

Glu Lys Thr Pro Glu Lys Gin 
300 

Thr Val Pro Thr Val Thr Val 
315 320 
Ser He Thr Pro Pro Ser Ser 
330 335 
Gin Gly Lys Gly Ser He Ser 
350 

Glu Ala Lys Gly Leu Ser Tyr 
365 

Gly Gin Leu Ala Asp Arg Pro 
380 

Thr He Pro Ser Pro Leu Asp 
395 400 
Asp Ser Glu lie Glu Leu Val 
410 415 
Ala Leu Pro Ser Gly Tyr Val 
430 

Pro Ser Pro Ala Ser Pro Ser 
445 

Arg Glu Ala Giu Leu Asp Ser 
460 

Ser Ser Ala Ser Glu Glu Ser 
475 480 
Met Lys Pro Ser Ala Leu Asp 
490 495 
Ala Glu Glu Arg Ala Pro Ser 
510 

Phe Leu Asp Tyr Pro Ser Thr 
525 

Pro Gly Asp Gly Ala Leu Glu 
540 

Pro Glu Glu Asp Ser Ser Ser 
555 560 
Pro Gly Pro Leu Gly Pro Gly 
570 575 
Lys Gin Lys Ala lie Asp Leu 
590 
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Leu 


Tyr 


Tro 
59*5 


Arg 


Asp 


lie 


Lys 


Gin 
600 


Thr 


Gly 


11 


Val 


Ph 
605 


Gly 


Ser 


Phe 


Leu 


Leu 
610 


Leu 


Leu 


Phe 


Ser 


Leu 
615 


Thr 


Gin 


Phe 


Ser 


Val 
620 


Val 


Ser 


Val 


Val 


Ala 


Tyr 


Leu 


Ala 


Leu 


Ala 


Ala 


Leu 


Ser 


Ala 


Thr 


He 


Ser 


Phe 


Arg 


He 


625 








630 










635 










640 


Tyr 


Lys 


Ser 


Val 


Leu 


Gin 


Ala 


Val 


Gin 


Lys 


Thr Asp 


Glu 


Gly 


His 


Pro 








645 










650 










655 




Phe 


Lys 


Ala 


Tyr 


Leu 


Glu 


Leu 


Glu 


He 


Thr 


Leu 


Ser 


Gin 


Glu 


Gin 


He 






660 










665 










670 






Gin 


Lys 


Tyr 


Thr 


Asp 


Cys 


Leu 


Gin 


Phe 


Tyr 


Val 


Asn 


Ser 


Thr 


Leu Lys 




675 










660 










685 








Glu 


Leu 
690 


Arg 


Arg 


Leu 


Phe 


Leu 
695 


Val 


Gin 


Asp 


Leu 


Val 
700 


Asp 


Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu 


Met 


Trp 


Leu 


Leu 


Thr 


Tyr 


Val 


Gly 


Ala 


Leu 


Phe 


Asn 


705 










710 








715 










720 


Gly 


Leu 


Thr 


Leu 


Leu 


Leu 


Met 


Ala 


Val 


Val 


Ser 


Met 


Phe 


Thr 


Leu 


Pro 








725 










730 










735 




Val 


Val 


Tyr 


Val 


Lys 


His 


Gin 


Ala 


Gin 


He 


Asp 


Gin 


Tyr 


Leu 


Gly Leu 








740 










745 










750 






Val 


Arg 


Thr 
755 


His 


He 


Asn 


Ala 


Val 
760 


Val 


Ala 


Lys 


He 


Gin 
765 


Ala 


Lys 


He 


Pro 


Gly 
770 


Ala 


Lys 


Arg 


His 


Ala 
775 


Glu 



















(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307309 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Met 


Ala 


Ala 


Glu 


Asp 


Ala 


Leu 


Pro 


Ser 


Gly 


Tyr 


Val 


Ser Phe Gly 


His 


1 








5 










10 






15 




Val 


Gly 


Gly 


Pro 


Pro 


Pro 


Ser 


Pro 


Ala 


Ser 


Pro 


Ser 


He Gin Tyr 


Ser 






20 










25 








30 




He 


Leu 


Arg 


Glu 


Glu 


Arg 


Glu 


Ala 


Glu 


Leu 


Asp 


Ser 


Glu Leu He 


He 






35 










40 








45 




Glu 


Ser 


Cys 


Asp 


Ala 


Ser 


Ser 


Ala 


Ser 


Glu 


Glu 


Ser 


Pro Lys Arg 


Glu 




50 










55 










60 




Gin Asp 


Ser 


Pro 


Pro 


Met 


Lys 


Pro 


Ser 


Ala 


Leu 


Asp 


Ala He Arg 


Glu 


65 










70 










75 




80 


Glu 


Thr 


Gly 


Val 


Arg 


Ala 


Glu 


Glu 


Arg 


Ala 


Pro 


Ser Arg Arg Gly 


Leu 










85 










90 






95 




Ala 


Glu 


Pro 


Gly 


Ser 


Phe 


Leu 


Asp 


Tyr 


Pro 


Ser 


Thr 


Glu Pro Gin 


Pro 








100 










105 








110 




Gly 


Pro 


Glu 


Leu 


Pro 


Pro 


Gly 


Asp 


Gly 


Ala 


Leu 


Glu 


Pro Glu Thr 


Pro 




115 








120 








125 




Met 


Leu 


Pro 


Arg 


Lys 


Pro 


Glu 


Glu 


Asp 


Ser 


Ser 


Ser 


Asn Gin Ser 


Pro 




130 






135 








140 






Ala 


Ala 


Thr 


Lys 


Gly 


Pro 


Gly 


Pro 


Leu 


Gly 


Pro 


Gly Ala Pro Pro 


Pro 


145 










150 










155 






160 


Leu 


Leu 


Phe 


Leu 


Asn 


Lys 


Gin 


Lys 


Ala 


lie 


Asp 


Leu 


Leu Tyr Trp 


Arg 
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165 



Asp 


He 


Lys 


Gin 


Thr Gly He 


Val 








180 






Phe 


Ser 


Leu 


Thr 


Gin Phe Ser 


Val 






195 






200 


Leu 


Ala 


Ala 


Leu 


Ser Ala Thr 


He 




210 






215 




Leu 


Gin 


Ala 


Val 


Gin Lys Thr 


Asp 


225 








230 


Leu 


Glu 


Leu 


Glu 


He Thr Leu 


Ser 










245 




Asp Cys 


Leu 


Gin 


Phe Tyr Val 


Asn 








260 






Leu 


Phe 


Leu 


Val 


Gin Asp Leu 


Val 






275 




280 


Met 


Trp Leu 


Leu 


Thr Tyr Val 


Gly 




290 






295 


Leu 


Leu 


Met 


Ala 


Val Val Ser 


Met 


305 








310 




Lys 


His 


Gin 


Ala 


Gin lie Asp 


Gin 










325 




He 


Asn 


Ala 


Val 


Val Ala Lys 


He 








340 




Arg 


His 


Ala 


Glu 










355 









170 175 



Phe 


Gly Ser 


Phe 


Leu 


Leu Leu Leu 


185 










190 


Val 


Ser 


Val 


Val 


Ala 


Tyr Leu Ala 










205 




Ser 


Phe 


Arg 


He 


Tyr 


Lys Ser Val 








220 






Glu 


Gly 


His 


Pro 


Phe 


Lys Ala Tyr 






235 






240 


Gin 


Glu 


Gin 


He 


Gin 


Lys Tyr Thr 




250 








255 


Ser 


Thr 


Leu 


Lys 


Glu 


Leu Arg Arg 


265 










270 


Asp 


Ser 


Leu 


Lys 


Phe 


Ala Val Leu 










285 




Ala 


Leu 


Phe 


Asn 


Gly Leu Thr Leu 








300 






Phe 


Thr 


Leu 


Pro 


Val 


Val Tyr Val 






315 






320 


Tyr 


Leu 


Gly Leu 


Val 


Arg Thr His 




330 








335 


Gin 


Ala 


Lys 


He 


Pro Gly Ala Lys 


34 5 










350 



(2) INFORMATION FOR SEQ ID NO: 7: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met 


Gin 


Ala 


Thr 


1 








Trp 


Lys 


Ser 


Gin 








20 


Thr Gly 


He 


Val 






35 




Gin 


Phe 


Ser 


Val 




50 






Ser Ala 


Thr 


lie 


65 








Gin 


Lys 


Thr 


Asp 


He 


Thr 


Leu 


Ser 








100 


Phe 


Tyr 


Val 


Asn 






115 




Gin 


Asp 


Leu 


Val 




130 






Thr 


Tyr 


Val 


Gly 


145 






Val 


Val 


Ser 


Met 



Ala 


Asp 


Ser 


Thr 


5 








Ala 


He 


Asp 


Leu 


Phe 


Gly 


Ser 


Phe 








40 


Val 


Ser 


Val 


Val 






55 




Ser 


Phe 


Arg 


He 




70 






Glu 


Gly 


His 


Pro 


85 








Gin 


Glu 


Gin 


He 


Ser 


Thr 


Leu 


Lys 








120 


Asp 


Ser 


Leu 


Lys 






135 




Ala 


Leu 


Phe 


Asn 




150 






Phe 


Thr 


Leu 


Pro 



Lys 


Met 


Asp Cys 




10 




Leu 




Trp Arg 


25 






Leu 


Leu 


Leu Leu 


Ala 


Tyr 


Leu Ala 






60 


Tyr 


Lys 


Ser Val 






75 


Phe 


Lys 


Ala Tyr 




90 




Gin 


Lys 


Tyr Thr 


105 






Glu 


Leu 


Arg Arg 


Phe 


Ala 


Val Leu 






140 


Gly 


Leu 


Thr Leu 






155 


Val 


Val 


Tyr Val 



Val Trp Ser Asn 
15 

Asp He Lys Gin 
30 

Phe Ser Leu Thr 
45 

Leu Ala Ala Leu 

Leu Gin Ala Val 
80 

Leu Glu Leu Glu 
95 

Asp Cys Leu Gin 
110 

Leu Phe Leu Val 
125 

Met Trp Leu Leu 

Leu Leu Met Ala 
160 

Lys His Gin Ala 
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165 170 175 

Gin lie Asp Gin Tyr Leu Gly Leu Val Arg Thr His He Asn Ala Val 

180 185 190 

Val Ala Lys He Gin Ala Lys lie Pro Gly Ala Lys Arg His Ala Glu 

195 200 205 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS : 
I A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDKESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBanJc 

(B) CLONE: 281046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Asp Cys Val Trp Ser Asn Trp Lys Ser Gin Ala He Asp Leu Leu 

15 10 15 

Tyr Trp Arg Asp He Lys Gin Thr Gly He Val Phe Gly Ser Phe Leu 

20 25 30 

Leu Leu Leu Phe Ser Leu Thr Gin Phe Ser Val Val Ser Val Val Ala 

35 40 45 

Tyr Leu Ala Leu Ala Ala Leu Ser Ala Thr He Ser Phe Arg He Tyr 

50 55 60 

Lys Ser Val Leu Gin Ala Val Gin Lys Thr Asp Glu Gly His Pro Phe 
65 70 75 80 

Lys Ala Tyr Leu Glu Leu Glu He Thr Leu Ser Gin Glu Gin He Gin 

85 90 95 

Lys Tyr Thr Asp Cys Leu Gin Leu Tyr Val Asn Ser Thr Leu Lys Glu 

100 105 110 

Leu Arg Arg Leu Phe Leu Val Gin Asp Leu Val Asp Ser Leu Lys Phe 

115 120 125 

Ala Val Leu Met Trp Leu Leu Thr Tyr Val Gly Ala Leu Phe Asn Gly 

130 135 140 

Leu Thr Leu Leu Leu Met Ala Val Val Ser Met Phe Thr Leu Pro Val 
145 150 155 160 

Val Tyr Val Lys His Gin Ala Gin Val Asp Gin Tyr Leu Gly Leu Val 

165 170 175 

Arg Thr His He Asn Thr Val Val Ala Lys He Gin Ala Lys He Pro 

180 185 190 

Gly Ala Arg Gly Met Leu Ser Arg Trp Leu Pro Gin Glu Lys Pro Asp 

195 200 205 

Met Asn Gly Gly Val Trp Ser Gly Asn Ser Ser Leu Leu Pro Arg Tyr 

210 215 220 

Cys Glu Leu He Val Ser Leu Pro Gin Tyr His Asn Leu Arg Gly Lys 
225 230 235 240 

Leu Arg Asp Arg Cys Phe Gin Ser Phe Pro Val Leu Leu Gly Tyr Leu 

245 250 255 

Ser Pro Pro Arg Pro Leu Ser Ser Thr Lys Val 
260 265 

(2) INFORMATION FOR SEQ ID NO: 9: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single f 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: SPLNFET01 

(B) CLONE: 28742 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CCTATNCCNG CTGCTTTCAT TGACAGTATT CAGCATTGTG AGCGTAACAG CCTACATTGC 

CTTNGCCCTG CNCTCTGTGA CCATCAGCTN TAGGCTATAC AAGGGTGTGA TCCAAGCTAT 

CCAGAAATCA GATGAAGGNC ACCCATTCAG GGCATATCTG GANTCTGAAG TTGCTATATC 

TGAGGAGTTG NTTCAGAAGT ACACGTAAAT NNTGNNCNTG GTCAATGTGA NCTCCACGNC 
TAANGGANCT CAGGTGCCTA T 
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CLAIMS 

1. A substantially purified human NSPLP protein comprising the 
amin acid sequence of SEQ ID N0:1 or fragments thereof. 

2. An isolated and purified polynucleotide sequence encoding a 
protein of claim 1. 

3. An isolated and purified polynucleotide sequence of claim 2 
consisting of the sequence of SEQ ID NO: 2 or degenerate variants thereof. 

4. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 2 or degenerate variants thereof. 

5. An isolated and purified polynucleotide sequence of claim 2 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID N0:2. 

6. A recombinant expression vector containing a polynucleotide 
sequence of claim 2. 

7. A recombinant host cell comprising a polynucleotide sequence of 
claim 2. 

8. A method for producing a polypeptide comprising the amino acid 
sequence shown in SEQ ID N0:1, the method comprising the steps of: 

a) culturing the host cell of Claim 7 under conditions 
suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

9. A recombinant expression vector containing a polynucleotide 
sequence of claim 5. 

10. A recombinant host cell comprising a polynucleotide sequence 

of claim 9. 

11. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID N0:1) in conjunction with a suitable 
pharmaceutical carrier. 

12. A purified antibody which binds specifically to a polypeptide 
of claim 1. 

13. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 1. 

14. A pharmaceutical composition comprising a substantially 
purified antagonist of the polypeptide of claim 1 in conjunction with a 
suitable pharmaceutical carrier. 

15. A substantially purified human NSPLP protein comprising the 
amino acid sequence of SEQ ID N0:3 or fragments thereof. 

16. An isolated and purified polynucleotide sequence encoding a 
protein of claim 15. 

17. An isolated and purified polynucleotide sequence of claim 16 
consisting of the sequence of SEQ ID NO: 4 or degenerate variants thereof. 

18. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 4 or degenerate variants thereof. 
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19. An isolated and purified polynucleotide sequence of claim 16 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID NO: 4. 

20. A recombinant expression vector containing a polynucleotide 
sequence of claim 16. 

21. A recombinant host cell comprising a polynucleotide sequence 
of claim 16 t . 

22. A method for producing a polypeptide comprising the amino acid 
sequence shown in SEQ ID NO: 3, the method comprising the steps of: 

a) culturing the host cell of Claim 21 under conditions 
suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

23. A recombinant expression vector containing a polynucleotide 
sequence of claim 19. 

24. A recombinant host cell comprising a polynucleotide sequence 
of claim 23. 

25. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID NO: 3) in conjunction with a suitable 
pharmaceutical carrier. 

26. A purified antibody which binds specifically to a polypeptide 
of claim 15. 

27. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 15. 

28. A pharmaceutical composition comprising a substantially 
purified antagonist of the polypeptide of claim 15 in conjunction with a 
suitable pharmaceutical carrier. 
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